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OB PROTEIN RECEPTOR AND RELATED COMPOSITIONS AND METHODS 
FTRT.D OF THE 'INVENT TON 

5 The present invention relates to OB protein 

receptors, related compositions and methods of making 
and using such receptors and related compositions. 

BACKGROUND 

10 Although the molecular basis for obesity is 

largely unknown, the identification of the "OB gene" and 
protein encoded ("OB protein") has shed some light on 
mechanisms the body uses to regulate body fat deposi- 
tion. Zhang et al., Nature 372 : 425-432 (1994); see 

15 also , the Correction at Nature 374 : 479 (1995). The OB 
protein is active ±xi vivo in both oh/ oh mutant mice 
(mice obese due to a defect in the production of the OB 
gene product) as well as in normal, wild type mice. The 
biological activity manifests itself in, among other 

20 things, weight loss. See generally , Barinaga, "Obese" 

Protein Slims Mice, Science 269 : 475-476 (1995). See PCT 
International Publication Number WO 96/05309, 
"Modulators of Body Weight, Corresponding Nucleic Acids 
and Proteins, and Diagnostic and Therapeutic Uses 

25 Thereof," herein incorporated by reference. 

The other biological effects of OB protein are 
not well characterized. It is known, for instance, that 
in oh/oh mutant mice, administration of OB protein 
results in a decrease in serum insulin levels, and serum 

30 glucose levels. It is also known that administration of 
OB protein results in a decrease in body fat. This was. 
observed in both ob/ob mutant mice, as well as non-obese 
normal mice. Pelleymounter et al., Science 269 : 540-543 
(1995); Halaas et al . , Science 269 : 543-546 (1995). See 



i 
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also . Campfield et al., Science 269 : 546-549 (1995) 
(Peripheral and central administration of microgram 
doses of OB protein reduced food intake and body weight 
of oh/oh and diet-induced obese mice but not in dh/db 
5 obese mice.) In none of these reports have toxicities 
been observed, even at the highest doses . . . 

Despite the promise of clinical application 
of the OB protein, the mode of action of the OB protein 
in vivo is not clearly elucidated, in part due to the 

10 absence of information on the OB receptor. High affinity 
binding of the OB protein has been detected in the rat 
hypothalamus, reportedly indicating OB receptor loca- 
tion. Stephens et al.. Nature 277: 530-532 (1995). The 
db/db mouse displays the identical phenotype as the 

15 oh/oh mouse, i.e ., extreme obesity and Type II diabetes; 
this phenotype is thought to be due to a defective OB 
receptor, particularly since db/db mice fail to respond 
to OB protein administration. Sea .Stephens et al., 
supra . 

20 Identification of the OB protein receptor is 

key in determining the pathway of signal transduction. 
Moreover, identification of the OB protein receptor 
would provide powerful application in diagnostic uses, 
for example, to determine if individuals would benefit 

25 from OB protein therapy... Furthermore, the OB receptor 
could be a key component in an assay for determining 
additional molecules which bind to the receptor and 
result in desired biological activity. Further, such 
soluble receptor could enhance or alter the effective- 

30 ness of OB protein (or analog or derivative thereof) . 

SUMMARY O F THE INVENTION 

The present invention relates to a novel class 
of protein receptors, herein denominated "OB protein 
35 receptors" or "OB receptors", which are thought to 
selectively bind OB protein. As such, the novel OB 
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receptor family is provided, as well as novel members of 
such family. Also provided are nucleic acids, vectors 
and host cells containing such nucleic acids, related 
antisense nucleic acids, molecules which selectively 
5 bind to the OB protein receptor, and related composi- 
tions of matter, such as OB receptor protein/OB protein 
complexes. In other aspects, the present invention 
relates to methods of using the above compositions, such 
as therapeutic and/or diagnostic methods, and methods 
10 for preparing OB receptor ligands. 

DETAILED DESCRIPTION 

A novel family of OB receptors is provided. 
This novel family resulted from identification of a PCR 

15 fragment isolated from a human liver cell cDNA library . 
The original PCR fragment, from which primers were 
isolated, contained a "WSXWS" motif, common to cytokine 
receptors. As illustrated by the working examples 
below, using this fragment four members of this OB 

20 protein receptor family have been identified. These 

members, herein designated as "A", "B", and "C", and "D" 
are indentical at amino acid position 1-8 91 (using the 
numbering of Seq. ID No. 1), but diverge at position 892 
through the C-terminus . They vary in length at the 

25 C-terminus beyond amino acid 891, and the different 
forms appear to have different tissue distribution. 

Using hydrophobicity analysis, the leader 
sequence is likely to comprise amino acids (Seq. ID. 
No. 1) 1-21, 1-22, or 1-28. The first amino acid of the 

30 mature protein is likely to be 22 (F) , 23 (N) or 29 (T) . 
Most likely, based on analysis of eucaryotic cell 
expression (CHO cell expression see Example 8, infra ) . 
the first amino acid of the mature protein is 22 (F) . 
The beginning of the transmembrane domain appears to be 

35 located at position 840 (A) or 842 (L) . The end of the 
transmembrane domain appears to be located at position 
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862 (I), 863 (S) or 864 (H) . Thus, based on predictions 
from hydrophobicity analysis, for OB protein binding, at 
a minimum what is needed is the extracellular domain of 
the mature protein, amino acids 22, 23 or 29 through 
5 amino acids 839 (D) or 841 (G) . Therefore, the present 
class of OB receptor proteins includes those having 
amino acids (according to Seq. ID No. 1) : 

(a) 1-896; 

(b) 22-896; 
10 (c) 23-896; 

(d) 29-896; 

(e) 1-839; 

(f) 22-839; 
(h) 1-841; 

15 (i) 22-841; 

(j) 23-841; 

<k) 29-841; 

(1) 1-891; 

(m) 22-891, 
20 (n) 23-891, 

(o) 29-891, 

(p) the amino acids of subparts (1) 
through (o) having the C-terminal amino acids selected 
from among: 

25 (i) OB receptor B (Seq. ID No. 3) 

positions 892-904; 

(ii) OB receptor C (Seq. ID No. 5) 
positions 892- 958; and, 

(iii) OB receptor D (Seq. ID No. 7) 
30 positions 892-1165; 

(q) amino acids of subparts b, c, 

d, f, g, i, j, k, m, n, o, and any of (p) lacking a 
leader sequence, which have "an N-terminal methionyl 
residue. 

35 Also provided herein is what is thought to be 

a human splice variant of a soluble OB receptor. This 
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splice variant includes the extracellular domain at 
least up to amino acid 798 (of Seq. ID No. 1, for 
example) and has a unique 6 amino acid C-terminus at 
positions 799-804: G K F T I L. 
5 The functional domains of the OB receptor may 

be predicted using the information contained in Bazan et 
al., PNAS-USA £7: 6934-6938 (1990) (incorporated herein 
by reference) . For the present OB receptor, there are 
two hematopoietin domains, a random coil region, the 
10 transmembrane domain, and the intracellular domain* The 
overall geography may be illustrated as follows: 




Box2 Box3 



i j 

Using the information provided by Bazan, 
15 supra , the domains may be predicted, with essentially an 
error of approximately plus or minus three base pairs 
(as applied to all amino acid location specified for 
purposes of identifying the Bazan predicted domains) * 
The precise locations may be determined empirically by 
20 methods known in the art, such as preparing and 

expressing modified recombinant DNAs. The structural 
characteristics are though to be important for 
maintaining the structural integrity of the molecule, 
and therefore, to the extent that such structure is 
25 important for function, for functional characteristics 
as well. 

The hematopoietin domains (HI and H2) are 
thought to have two fibronectin type 3 repeats each, one 
set of paired cysteine residues each (thought to form a 
30 disulfide bridge) , and one "WSXWS box" (referring to the 
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single letter amino acid abbreviation, with "X" being 
any amino acid) . The fibrinectin type 3 domains may be 
identified by location of a double proline ("PP") , which 
marks the beginning of the second fibronectin type 3 
5 repeat; the actual beginning of such second fibronectin 
type 3 repeat is likely to begin about 3 amino acids 
upstream of that double proline. 

The first hematopoietin domain is likely to 
begin at amino acid 123 (using the numbering according 

10 to Seq. ID No. 1, for example), which is an isoleucine 
residue (I) . The last amino acid of the hematopoietin 
domain is likely to be amino acid 339, which is a lysine 
(K) residue. The two fibronectin type 3 repeats are 
likely to be located at (about) amino acids 123 through 

15 235 and 236 through 339. There is a single pair of 

cysteine residues which likely form a disulfide bridge, 
located at position 131 and position 142. The "WSXWS 
box" is located at position 319 through 323. 

The second hematopoietin domain is likely to 

20 begin at position 428, which is an isoleucine (I) and 

end at position 642 which is a glycine (G) . The paired 
fibronectin type 3 repeats are located at about position 
428 through position 535 and about position 536 through 
about position 642. One pair of cysteines is located at 

25 position 43 6 and position 447, and the second pair is 
located at position 473 and 488. The "WSXWS box" is 
located at position 622-626. 

Between the first and the second hematopoietin 
domain (amino acids 339-428, approximately) is a region 

30 of unknown functional significance. 

The random coil domain ("RC" between the H2 
and the transmembrane domain, "TM") is likely to begin 
at the amino acid following the end of the second 
hematopoietin domain, and is likely to end at the 

35 beginning of the transmembrane domain. This is likely 
to be from about amino acid 642 through amino acid 83 9 
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or 841 (with the transmembrane domain beginning at 
position 840 (A) or 842 (L) ) . The intracellular domain 
(~IC") is likely to begin at position 861 (L) , 862 (I), 
863 (S) or 864 (H) . 
5 The intracellular domain ("IC") contains three 

regions, or "boxes," thought to participate in signal 
transduction (two "JAK" boxes and a single "STAT " box, 
"Box 1", "Box 2", and "Box 3") • With respect to the 
numbering of the amino acid positions of the "D" form of 

10 the OB receptor (Seq. ID No. 7, below), box 1 is located 
at amino acid 871 (F) through 878 (P) . Box 2 is located 
at approximately amino acid number 921 (I) through 
931 (K) . Box 3 on the "D" form is located at approxi- 
mately position 1141 through 1144 (amino acids YMPQ, as 

15 the "STAT" box is typically a conserved region of "YXXQ" 
wherein "X" designates any amino acid) . The intracel- 
lular domain is thought to be responsible for signal 
transduction. One possible mode of action is via 
phosphorylation of various residues. See Ihle et al., 

20 Cell M: 331-334 (1996) (Review article, herein 
incorporated by reference.) 

One possible mode of action is that upon 
ligand binding (here, OB protein binding) , the OB 
receptor dimerizes with another receptor. A kinase 

25 ("JAK") binds to box 1, and becomes phosphorylated. 

(The JAK may already be bound prior to dimerization. ) 
Also, "STATS" bind to box 3 and become phosphorylated on 
a specific tyrosine. It is thought that this 
phosphorylation results, probably indirectly, in DNA 

30 binding protein production, which results in altered DNA 
transcription, and therefore altered expression. As seen 
below in Example 6, one measurement of the capability of 
an OB receptor to transduce signal is the degree of 
phosphorylation of JAK/STAT molecules. 

35 The C-terminus region is intracellular (of 

cell-bound OB receptor) . The differences in the C- 
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terminus among members of the present OB receptor family 
may result in differences in signal transduction among 
the species. Thus, the present OB receptors include at 
least the extracellular domain which is important for OB 
5 protein ligand binding. Nucleic acids encoding the 

present OB receptors, vectors, and host cells are also 
provided for herein. 

The extracellular domain may be modified and 
still retain the function of ligand binding, 

10 particularly by one or more of the following 

modifications: (a) the random coil domain (as indicated 
above, occuring downstream of the second hematopoietic 
domain through the beginning of the transmembrane 
domain) may be deleted (this may be approximately 

15 positions 642 through 839 or 841); (b) the "WSXWS" box 
may be modified by (i) substitution of the first serine 
with another amino acid, particularly conserved in terms 
of hydrophobicity and/or charge, such as a glycine; (ii) 
the last serine may be substituted with another amino 

20 acid, such as a threonine; (iii) the first tryptophan 

may be substituted with another amino acid, for example, 
a tyrosine. 

Human genomic DNA encoding OB receptor protein 
is also provided herein. The genomic DNA has been 

25 localized to human chromosome 1P31, which is believed to 
correspond to mouse chromosome 4, the location of the 
mouse db locus. 

Tissue distribution analysis demonstrates the 
presence of OB receptor nucleic acids is fairly 

30 ubiquitous, and particularly noted in the liver. It is 
also observed in the ovary, and heart; and, to a lesser 
extent, in small intestine, lung, skeletal muscle, 
kidney, and, to an even lesser extent, spleen, thymus, 
prostate, testes, placenta and pancreas (Example 2, 

35 below) . There may also be one or more forms of the OB 
receptor present in serum, such as soluble OB receptor, 
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which may be complexed to one or more forms of the OB 
protein. 

Amino Acid Sequences and Compositions 
5 According to the present invention, novel OB 

protein receptors and DNA sequences coding for all or 
part of such OB receptors are provided. The present 
invention provides purified and isolated polypeptide 
products having part or all of the primary structural 

10 conformation (i.e., continuous sequence of amino acid 
residues) and one or more of the biological properties 
(e.g., immunological properties and in vitro biological 
activity) and physical properties (e.g., molecular 
weight) of naturally-occurring mammalian OB receptor 

15 including allelic variants thereof. The term "purified 
and isolated" herein means substantially free of 
unwanted substances so that the present polypeptides are 
useful for an intended purpose. For example, one may 
have a recombinant human OB receptor substantially free 

20 of human proteins or pathological agents. These 

polypeptides are also characterized by being a product 
of mammalian cells, or the product of chemical synthetic 
procedures or of procaryotic or eucaryotic host 
expression (e.g., by bacterial, yeast, higher plant, 

25 insect and mammalian cells in culture) of exogenous DNA 
sequences obtained by genomic or cDNA cloning or by gene 
synthesis. The products of expression in typical yeast 
(e.g., Saccharomvces cerevisiae ) . insect, or procaryote 
(e.g., £. coli) host cells are free of association with 

30 any mammalian proteins. The products of expression in 
vertebrate (e.g., non-human mammalian (e.g. COS or CHO) 
and avian) cells are free of association with any human 
proteins. Depending upon the host employed, and other 
factors, polypeptides of the invention may be 

35 glycosylated with mammalian or other eucaryotic 

carbohydrates or may be non-glycosylated. One may modify 
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the nucleic acid so that glycosylation sites are 
included in the resultant polypeptide. One may choose to 
partially or fully deglycosylate a glycosylated 
polypeptide! Polypeptides of the invention may also 
5 include an initial methionine amino acid residue (at 

position -1 with respect to the first amino acid residue 
of the mature polypeptide) • 

In addition to naturally-occurring allelic 
forms of OB receptor, the present invention also 
10 embraces other OB receptor products such as polypeptide 
analogs of OB receptor and fragments of OB receptor. 
Following the procedures of the above noted published 
application by Alton et al . (WO 83/04053), one can 
readily design and manufacture genes coding for 
15 microbial expression of polypeptides having primary 

conformations which differ from that herein specified 
for in terms of the identity or location of one or more 
residues (e.g., substitutions, terminal and intermediate 
additions and deletions) . Alternately, modifications of 
20 cDNA and genomic genes may be readily accomplished by 
well-known site-directed mutagenesis techniques and 
employed to generate analogs and derivatives of OB 
receptor. Such products would share at least one of the 
biological properties of mammalian OB receptor but may 
25 differ in others. As examples., projected products of 
the invention include those which are foreshortened by 
e.g., deletions; or those which are more stable to 
hydrolysis (and, therefore, may have more pronounced or 
longer lasting effects than naturally-occurring) ; or 
30 which have been altered to delete one or more potential 
sites for glycosylation (which may result in higher 
activities for yeast-produced products) ; or which have 
one or more cysteine residues deleted or replaced by, 
e.g., alanine or serine residues and are potentially 
35 more easily isolated in active form from microbial 

systems; or which have one or more tyrosine residues 
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replaced by phenylalanine; or have an altered lysine 
composition (such as those prepared for purposes of 
derivatization) . Included are those polypeptides with 
amino acid substitutions which are "conservative" 
5 according to acidity, charge , hydrophobicity, polarity, 
size or any other characteristic known to those skilled 
in the art- See generally . Creighton, Proteins, W,H. 
Freeman and Company, N.Y., (1984) 498 pp, plus index, 
passim. One may make changes in selected amino acids so 

10 long as such changes preserve the overall folding or 
activity of the protein, (see Table 1, below) . Small 
amino terminal extensions, such as an amino-terminal 
methionine residue, a small linker peptide of up to 
about 20-25 residues, or a small extension that facili- 

15 tates purification, such as a poly-histidine tract, an 
antigenic epitope or a binding domain, may also be 
present . in general Ford et al . , Protein Expression 

and Purification 2.: 95-107, 1991, which is herein 
incorporated by reference. 
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Table 1 

^onfifirvative Am ino Acid Substitutions 



Basic: 


arginine 

lysine 
histidine 


Acidic: 


glutamic acid 
aspartic acid 


Polar: 


glutamine 
asparacrine 


Hydrophobic: 


leucine 
isoleucine 
valine 


Aromatic: 


phenylalanine 
tryptophan 
tyrosine 


Small: 


glycine 
alanine 
serine 
threonine 
methionine 



5 Also comprehended are polypeptide fragments 

duplicating only a part of the continuous amino acid 
sequence or secondary conformations within OB receptor, 
which fragments may possess one activity of mammalian 
(particularly human) OB receptor (e.g., immunological 
10 activity) and not others (e.g., OB protein binding 
activity) . 

Of applicability to OB receptor fragments and 
polypeptide analogs of the invention are reports of the 
immunological activity of synthetic peptides which 
15 substantially duplicate the amino acid sequence extant 
in naturally-occurring proteins, glycoproteins and 
nucleoproteins. More specifically, relatively low 
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molecular weight polypeptides have been shown to 
participate in immune reactions which are similar in 
duration and extent to the immune reactions of 
physiologically significant proteins such as viral 
5 antigens, polypeptide hormones-, and the like* Included 
among the immune reactions of such polypeptides is the 
provocation of the formation of specific antibodies in 
immunologically active animals. See , e.g . . Lerner et 
al., Cell 22'- 309-310 (1891); Ross et al., Nature 294 : 

10 654-656 (1891); Walter et al., PNAS-USA 77 : 5197-5200 
(1980); Lerner et al . , PNAS-USA, 2&: 3403-3407 (1891); 
Walter et al . , PNAS-USA 23.: 4882-4886 (1891); Wong et 
al., PNAS-USA 79: 5322-5326 (1982); Baron et al . , Cell 
28 : 395-404 (1982); Dressman et al., Nature 295 : 185-160 

15 (1982); and Lerner, Scientific American 248 : 66-74 

(1983) . See , also , Kaiser et al . Science 222* 249-255 

(1984) relating to biological and immunological 
activities of synthetic peptides which approximately 
share secondary structures of peptide hormones but may 

20 not share their primary structural conformation. The 

present invention also includes that class of polypep- 
tides coded for by portions of the DNA complementary to 
the protein-coding strand of the human cDNA or genomic 
DNA sequences of OB receptor i.e., "complementary 

25 inverted proteins" as described by Tramontano et al. 

Nucleic Acid Res. 12.: 5049-5059 (1984). Polypeptides or 
. .analogs thereof may also contain one or more amino acid 
analogs, such as peptidomimetics . 

Thus, the present class of OB receptor 

30 proteins includes those having amino acids (according to 
Seq. ID No. .1) : 

(a) 1-896; 

(b) 22-896; 

(c) 23-896; 
35 (d) 29-896 

(e) 1-839; 
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(f) 22-839; 

(g) 29-839; 

(h) 1-841; 

(i) 22-841; 
5 (j) 23-841; 

(k) 29-841; 
(1) 1-891; 
(m) 22-891; 
(n) 23-891; 

10 id) 29-891; 

(p) the amino acids of subparts (1) 
through (o)' having the C-terminal amino acid sequence 
beginning at position 892 of OB receptor B (Seq. ID 
No. 3) or C (Seq. ID. No. 5) ; 
15 (q) amino acids of subparts b, c, d, f, 

g, i, j, k/ m, n, o, and any of (p) lacking a leader 
sequence, which have an N-terminal methionyl residue. 

Also provided is a longer form of an OB 
receptor protein, herein denominated the "D" form, which 
20 has an amino acid sequence selected from among 
(according to Seq. ID No. 7) : 

(a) amino acids 1-1165; 

(b) amino acids 22-1165; 

(c) amino acids 23-1165; 
25 <d) amino acids 29-1165; 

(e) amino acids of subparts (b) , (c) or 
(d) having an N-terminal methionyl residue. 

As set forth above, one may prepare soluble 
receptor by elimination of the transmembrane and intra- 
30 cellular regions. Examples of soluble receptors include 
those set forth in Seq. ID Nos . 10 and 13. What is 
thought to be a native, secreted form of a soluble human 
OB receptor is also provided herein. This form of OB 
receptor protein has an amino acid sequence selected 
35 from among (according to Seq. ID No. 13) : 

(a) amino acids. 1-804; 
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(b) amino acids 22-804; 

(c) amino acids 23-804; 

(d) amino acids 29-804; and, 

(e) amino acids of subparts (b) , (c) or 
5 (d) having an N-terminal methionyl residue. 

In addition, since the C-terminus region of 
the above polyeptides diverges at position 892 (with 
respect to Seq. ID Nos. 1, 3, 5, 7 and 13) one may 
desire to prepare only the polypeptides which are 
10 divergent: 

(a) those having only amino acids 892-896 

of Seq. ID No. 1; 

(b) those having only amino acids 892-904 

of Seq. ID No. 3; 
15 ( C ) those having only amino acids 8 92-958 

of Seq. ID No. 5; 

(d) those having only amino acids 892- 

1165 of Seq. ID No. "7; and, 

(e) those having only amino acids 799-804 

20 Of Seq. ID No. 13- 

The above polypeptides which have an 
extracellular domain may be modified, as indicated 
above, and still retain the function of ligand binding. 
Such modification may include one or more of the 

25 following: 

(a) the random coil domain (as indicated 
above, occuring downstream of the second hematopoietic 
domain through the beginning of the transmembrane 
domain) may be deleted (this may be approximately 

30 positions 642 through 839 or 841); 

(b) the "WSXWS" box may be modified by 
(i) substitution of the first serine with another amino 
acid, particularly conserved in terms of hydrophobicity 
and/or charge, such as a glycine; (ii) the last serine 

35 may be substituted with another amino acid, such as a 
threonine; (iii) the first tryptophan may be 
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substituted with another amino acid, for example, a 
tyrosine. 



Thus, the present polypeptides include 
(according to the numbering of Seq. ID No. 7) : 



5 


(a) 


1-896; 




(b) 


22-896; 




<c) 


23-8 96; 




(d) 


29-896 




(e) 


1-839; 


10 


<f) 


22-839; 




<g) 


29-839; 




<h) 


1-841; 




(i) 


22-841; 




(j) 


23-841; 


15 


(k) 


29-841; 




(1) 


1-891; 




(m) 


22-891; 




(n) 


23-891; 




(o) 


29-891; 


20 


(P) 


the amino acids of subparts (1) 



through (o) having the C-terminal amino acids selected 
from the C-terminal amino acids of OB receptor B (Seq. 
ID No. 3), C (Seq. ID. No. 5) and D (Seq ID No. 7) ; 

(q) the amino acids (according to Seq. ID 

25 No. 13) selected from the group consisting of 22-804; 
23-804 and 29-804; 

(r) amino acids of subparts b, c, d, f, 
g, i, j, k, m, n, o, any of (p) lacking a leader 
sequence, and (q) which have an N-terminal methionyl 

30 residue; and 

(s) amino acids of subparts (a) through 
(r) .which above having at least one of the following 
modifications: 

(i) for amino acids of subparts (a) 
35 through (p) and those of subpart (r) which are not amino 
acids according to subpart (q) , deletion of (or 
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substitution of amino acid(s) or other modifications of) 
a random coil domain sequence selected from 

(a) 640 through 839 (using 
the numbering according to Seq. ID No. 1) ; 

5 (b) 641 through 839; 

(c) 642 through 839; 

(d) 64 0 through 841; 

(e) 641 through 841; and 

(f) 642 through 841; 

10 (ii) for amino acids of subpart (q) 

and those of subpart (r) which contain the sequence of 
subpart (q) , deletion of of (or substitution of amino 
acid(s) or other modifications of) a random coil domain 
sequence selected from among: 

15 (a) 640 through 804; 

(b) 641 through 804; and, 

(c) 642 through 804; 

and, 

(iii) modification of a "WSXWS" 

20 sequence which is 

(a) substitution of the first 
serine with another amino acid, particularly conserved 
in terms of hydrophobicity and/or charge, such as a 
glycine; 

25 (b) substition of the last 

serine with another amino acid, such as a threonine; 
and 

(c) substitution of the first 
tryptophan with another amino acid, for example, a 
30 tyrosine. 

One may modify the OB receptor to create a 
fusion molecule with other peptide sequence. For 
example, if one desired to "tag" the OB receptor with an 
immunogenic peptide, one could construct a DNA which 
35 would result in such fusion protein. The tag may be at 
the N-terminus. Also, since it is apparent that 'the 
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C-terminus is not necessary for ligand binding activity, 
one may chemically modify the C-terminus of, for 
example, a soluble OB receptor. One may desire, for 
example, a preparation whereby one or more polymer 
5 molecules such as polyethylene glycol molecules are 

attached. Thus, another aspect of the present invention 
is chemically modified OB receptor protein (also further 
described infra) . 

An example of such "tag" is provided herein 

10 using the C-terminus of a recombinant soluble OB 

receptor. Seq. ID No. 12 provides a "FLAG-tag" version 
of such soluble OB receptor (the nucleic acid sequence 
is provided, which may be transcribed to prepare the 
polypeptide) . Such "FLAG-tag" may also be attached to 

15 the N-terminus or other region of an OB receptor 

protein. This type of "tagging" is useful to bind the 
protein using reagents, such as antibodies, which, are 
selective for such tag. Such binding may be for 
detection of the location or amount of protein, or for 

20 * protein capturing processes where, for example, an 

affinity column is used to bind the tag, and thus the 
desired protein. Other types of detectable labels, such 
as radioisotopes, light-emitting (e.g., fluorescent or 
phosporescent compounds) , enzymatically cleavable, 

25 detectable antibody (or modification thereof) , or other 
substances may be used for such labelling of the present 
proteins. Detecting protein via use of the labels may 
be useful for identifying the presence or amount of OB 
receptor protein or a compound containing such protein 

30 (e.g., OB protein complexed to OB receptor). Moreover, 
such labelled protein may be useful for distinguishing 
exogenous OB receptor protein from the endogenous form. 
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ni No vel nucleic acid sequences of the invention 

include sequences useful in securing expression in 
procaryotic or eucaryotic host cells of polypept.de 
products having at least a part of the primary struc- 
tural conformation and one or more of the biological 
properties of recombinant human OB receptor. The 
nucleic acids may be purified and isolated, so that the 
desired coding region is useful to produce the present 
.polypeptides, for example, or for diagnostic purposes, 
as described more fully below. DNA sequences of the 



invention specifically comprise: (a) any of the DNA 
sequences set forth in Seq. ID No. 2, 4, 6, 8 9 11. 
15 12, and 14 (and complementary strands); (b) a DNA 



sequence which hybridizes (under hybridization 
conditions disclosed in the cDNA library screening 
section below, using the 300 bp PGR fragment as 
described to selectively hybridize to a cDNA encoding an 
20 OB receptor protein in a human liver cDNA library, or 
equivalent conditions or more stringent conditions) to 
t ne DNA sequence in subpart (a) or to fragments thereof, 
and (O a DNA sequence which, but for the degeneracy of 
t he genetic code, would hybridize to the DNA sequence, 
25 subpart (a) . Specifically comprehended in parts (b) and 
(c) are genomic DNA sequences encoding allelic variant 
forms of human OB receptor and/or encoding ^ receptor, 
from other mammalian species, and manufactured DNA 
sequences encoding OB receptor, fragments of OB 
30 refeptor, and analogs of OB receptor which 

ma y incorporate codons facilitating transcription and 
translation of messenger RNA in microbial hosts. Such 
manufactured sequences may readily be constructed 
according to the methods of Alton et al., PCT published 
35 application WO 83/04053. 



WO 97/25424 



PCT/US97/00128 



- 20 - 

Genomic DNA, such as that of Seq. ID No. 9, 
encoding the present OB receptors may contain additional 
non-coding bases, or introns, and such genomic DNAs are 
obtainable by hybridizing all or part of the cDNA, 
5 illustrated in Seq. ID Nos. 2, 4, 6, 8, 11, and 14 to a 
genomic DNA source, such as a human genomic DNA library. 
Such genomic DNA will encode functional OB receptor 
polypeptide; however, use of the cDNAs may be more 
practicable in that, since only the coding region is 
10 involved, recombinant manipulation is facilitated. The 

intron/exon location of genomic DNA is set forth in Seq. 
id No. 9, infra. 

Nucleic acid sequences include the 
incorporation of codons which enhance expression by 

15 selected nonmammalian hosts; the provision of sites for 
cleavage by restriction endonuclease enzymes; and the 
provision of additional initial, terminal or 
intermediate DNA sequences which facilitate construction 
of cloning and/or expression vectors. 

20 The present invention also provides DNA 

sequences coding for polypeptide analogs or derivatives 
of OB receptor which differ from naturally-occurring 
forms in terms as described above. The leader sequence 
DNA may be substituted with another leader sequence for 

25 ease in expression or for other purposes. 

Also, one may prepare antisense nucleic acids 
against the present DNAs. Such antisense nucleic acids 
may be useful in modulating the effects of OB receptor 
protein in yjvp.. For example, one may prepare an 

30 antisense nucleic acid which effectively disables the 
ability of a cell to produce OB receptor by binding to 
the nucleic acid which encodes such OB receptor. 

DNA sequences of the invention are also 
suitable materials for use as labeled probes in 

J5 isolating human genomic DNA encoding OB receptor, as 
mentioned above, and related proteins as well as cDNA 
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and genomic DNA sequences of other mammalian *P ecies ; 
£ lequences may also be useful in various al-rnat.ve 
me thodTof protein synthesis (e.g., in insect cells) or, 
r S descried 'infx*, in genetic therapy in humans and 
5 other mammals. DNA sequences of the invention are 

expected to be useful in developing transgenic mammalxan 
species which may serve as eucaryotic "hosts" for 
production of OB receptor and OB receptor product. , xn 
Entity. See, generally, Palmiter et al., Scxence 22Z- 
10 809-814 (1983) . 

y^ t ^ and HO^t CelU. 

According to another aspect of the present 

invention, the DNA sequences ^ cribe ^Tl^the 
15 encode OB receptor polypeptides are valuable forthe 
1 Nation which they provide . concerning t,e a^n ^ 
sequence of the mammalian protein which have heretofore 
b een unavailable. Put another way, DNA sequences 
provided by the invention are useful xn generating new 
20 and useful viral and circular plasmid DNA vectors new 
and useful transformed and transfected P^^"^ 
eucaryotic host cells (including bacterial cells yeast 
cells, insect cells, and mammalian cells grown in 
culture) , and new and useful methods for ^ 
25 of such host cells capable of expression of OB receptor 

and its related products. „* iri „ 
The DNA provided herein (or corresponding 
BKAs) may also be used for gene therapy for, example, 
treatment of conditions characterized by the 
30 overexpression of OB protein, such as anorexia or 

cachexia. Alternatively, gene therapy may be used xn 
cases where increased sensitivity to OB protexn xs 
desired, such as in cases where an individual has a 
condition characterized by OB protein receptors . 
35 defective in ability to bind or retain the bxndxn, , of OB 
protein. Currently, vectors suitable for gene therapy 
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35 



(such as retroviral or adenoviral vectors modified for 
gene therapy purposes and of purity and pharmaceutical 
acceptability) may be administered for delivery into the 
lung, for example. Such vectors may incorporate nucleic 
acid encoding the present polypeptides for expression in 
a desired location. Gene therapy may involve more than 
one gene for a desired protein or different desired 
proteins . 

Alternatively, one may use no vector so as to 
facilitate relatively stable presence in the host. For 
example, homologous recombination of a DNA as provided 
herein or of a suitable transcription or translation 
control region may facilitate integration into or 
expression from a host genome. (This may be performed 
15 for production purposes as well, e.g., U.S. Patent 

No. 5,272,071 and WO 91/09955.) The nucleic acid may be 
Placed within a pharmaceutical^ acceptable carrier to 
facilitate cellular uptake, such as a lipid solution 
carrier (e.g., a charged lipid), a liposome, or 
polypeptide carrier (e.g., polylysine) . A review 
article on gene therapy is Verma, Scientific American, 
November 1990, pages 68-84 which is herein incorporated 
by reference. 

Thus, the present invention provides for a 
population of cells expressing an OB receptor of the 
present OB receptor family. Such cells are suitable for 
transplantation or implantation into an individual for 
therapeutic purposes. For example, one may prepare a 
population of cells to overexpress OB receptor (such as 
one identified in the Sequence id's or otherwise denoted 
herein) , or to express a desired form of OB receptor, 
such as one which is particularly sensitive to OB 
protein (i.e., a form which has a desired capacity for 
signal transduction) . One may then implant such cells 
into an individual*. to increase that individual's 
sensitivity to OB protein. Such cells may, for example, 



20 



25 



30 
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be liver cells, bone marrow cells, or cells derived from 
umbillical cord. Alternatively, one may wish to use 
overexpressing circulating cells such as blood 
progenitor cells, T cells or other blood cells. For 
humans, human cells may be used. Cells may be in the 
form of tissue. Such cells may be cultured prior to 
transplantation or implantation. Such OB receptor 
overexpression, or expression of particularly sensitive 
forms of OB receptor may be accomplished by, for 
example, altering the regulatory mechanism for 
expression of OB receptor, such as using homologous 
recombination techniques as described suet*. Thus, 
provided is a population of host cells modified so that 
expression of endogenous OB receptor DHA is enhanced. 
5 The cells to be transferred to the recipient 

may be cultured using one or more factors affecting the 
growth or proliferation of such cells if appropriate. 
Hematopoietic factors may be used in culturing 
hematopoietic cells. Such factors include G-CSF, EPO, 
20 MGDF, SCF, Flt-3 ligand, interleukins (e.g., IL1-IL13) , 
GM-CSF, LIF, and analogs and derivatives thereof as 
available to one skilled in the art. 

Nerve cells, such as neurons or glia, may also 
be used, and these may be cultured with neurotrophic 
25 factors such as BDNF, CNTF, GDNF, NT3, or others. 

There may be a co-gene therapy involving the 
transplantation of cells expressing more than one 
desired protein. For example, cells expressing OB 
receptor protein may be used in conjunction, 
30 simultaneously or ia ^rrjatim with cells expressing OB 
protein. 

For gene therapy dosages, one will generally 
use between one copy and several thousand copies of the 
present nucleic acid per cell, depending on the vector, 
35 the expression system, the age, weight and condition of 
the recipient and other factors which will be apparent 
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to those skilled in the art. The cellular delivery of 
such protein may be designed to last for a selected 
period of time, such as a period of days, weeks, months 
or years. At the end of the effective time period, the 
5 recipient of such transformed cells may receive another 
-dose" (e.g., transplantation of cells). Cells may be 
selected for their lifespan, their time period of 
expression of the desired protein, or their ability to 
be reisolated from an individual (i.e., for blood cells, 
10 leukaphoresis may be used to retrieve transformed cells 
using markers present on the cell surface) . Vectors may 
be similiarly designed using, for example, viruses which 
have a known period of expression of DNAs contained 
therein. 

15 The desired cells or vectors may be stored 

using techniques, such as freezing, available to those 
in the art. 

Thus, the present invention also contemplates 
a method for administering OB receptor protein to an 

20 individual, wherein the source of said OB receptor 
protein is selected from (i) a population of cells 
expressing OB receptor protein and (ii) a population of 
vectors expressing OB receptor protein. Said OB 
receptor protein may be selected from among those 

25 described herein. Said vectors may be virus vectors 
capable of infecting human cells. Said cells may be 
selected from among tissue or individual cells. Said 
individual cells may be selected from among adipocytes, 
fibroblasts, bone marrow cells, peripheral blood 

30 progenitor cells, red blood cells, and white blood 
. cells, including T cells and nerve cells. Said 
population of cells or vectors may be co-administered 
with a population of cells or vectors which express OB 
protein or another desired protein. Said cells or 

35 vectors may be stored for use in an individual. Storage 
may be by freezing 
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Complexes 

In addition to the OB receptor protein as 
described hetein, one may prepare complexes of OB 
5 receptor protein and OB protein, analog or derivative. 

The OB protein may be selected from those 
described in PCT publication WO 96/05309, above and 
hereby incorporated by reference in its entirety. 
Figure 3 of that publication (Seq. ID No. 4, as cited 

10 therein) depicts the full deduced amino acid sequence 
derived for the human OB gene. The amino acids are 
numbered from 1 to 167. A signal sequence cleavage site 
is located after amino acid 21 (Ala) so that the mature 
protein extends from amino acid 22 (Val) to amino acid 

15 167 (Cys) . For the present disclosure, a different 

numbering is used herein, where the amino acid position 
1 is the Valine residue which is at the beginning of the 
mature protein. 

Generally, the OB protein for use will be 

20 capable of complexing to the OB protein receptor 

selected. Thus, one may empirically test the binding 
capability (to all or part of the extracellular domain 
of the OB receptor as indicated above) to determine 
which OB protein forms may be used. Generally, 

25 modifications generally applicable as indicated above 
for OB receptor protein may also be applied here, and 
that disclosure is incorporated by reference here. As 
set forth in WO 96 05309, OB protein in its native form, 
or fragments (such as enzyme cleavage products) or other 

30 truncated forms, analogs, and derivatives all retain 

biological activity. Such forms may be used so long as 
the form binds to at least a portion of the 
extracellular domain of the present OB receptor 
proteins . 

35 An effective amount of an OB protein, analog 

or derivative thereof may be selected from among 
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according to the amino acid sequence as presented in PCT 
WO 96/05309, Figure 3 numbered so. that the first amino 
acid of the mature protein is number 1: 

(a) the amino acid sequence 1-14 6, 

5 optionally lacking a glutaminyl residue at position 28, 
and further optionally having a methionyl residue at the 
N-terminus; 

(b) an amino acid sequence of subpart 
(a) having a different amino acid substituted in one or 

10 more of the following positions: 4, 8, 32, 33, 35, 48, 
50, 53, 60, 64, 66, 67, 68, 71, 74, 77, 78, 89, 97, 
100, 102, 105, 106, 107, 108, 111, 112, 118, 136, 138, 
142, and 145; 

(c) a truncated OB protein analog 

15. selected from among: (using the numbering of subpart (a) 
above) : 

(i) amino acids 98-14 6 
, (ii) amino acids .1-32 

(iii) amino acids 1-35 
20 (iv) amino acids 40-116 

(v) amino acids 1-99 and 112-146 
(vi) amino acids 1-99 and 112-14 6 
having one or more of amino acids 100-111 
sequentially placed between amino acids 99 and 112; 
25 and, 

(vii) the truncated OB analog of 
subpart (i) having one or more of amino acids 100, 
102, 105, 106, 107, 108, 111, 112, 118, 136, 138, 
142, and 145 substituted with another amino acid; 

30 (viii) the truncated analog of subpart 

<ii) having one or more of amino acids 4, 8 and 32 
substituted with another amino acid; 

(ix) the truncated analog of subpart 
(iv) having one or more of amino acids 50, 53, 60, 

35 64, 66, 67, 68, 71, 74, 77, 78, 89, 97, 100, 102, 
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105, 106, 107, 108, 111 and 112 replaced with 
another amino acid; 

(x) the truncated analog of subpart 

(v) having one or more of amino acids 4, 8, 32, 33, 
5 35, 48, 50, 53, 60, 64, 66, 67, 68, 71, 74, 77, 

78, 89, 97, 112, 118, 136, 138, 142, and 145 
replaced with smother amino acid; 

(xi) the truncated analog of subpart 

(vi) having one or more of amino acids 4, 8,32, 33, 
10 35, 48, 50, 53, 60, 64, 66, 67, 68, 71, 74, 77, 

78, 89, 97, 100, 102, 105, 106, 107, 108, 111, 
112, 118, 136, 138, 142, and 145 replaced with 
another amino acid; 

(xii) the truncated analog of any of 
15 subparts (i)-(xi) having an N-terminal methionyl 

residue; and 

(d) the OB protein or analog derivative 
of any of subparts (a) through (c) comprised of a 
chemical moiety connected to the protein moiety; 
20 (e) a derivative of subpart (d) wherein 

said chemical moiety is a water soluble polymer moiety; 

(f) a derivative of subpart (e) wherein 
said water soluble polymer moiety is polyethylene 
glycol; 

25 (g) A derivative of subpart (f) wherein 

said water soluble polymer moiety is a polyamino acid 
moiety; 

(h) a derivative of subpart (g) wherein 
said water soluble polymer moiety is attached at solely 

30 the N-terminus of said protein moiety; 

(i) an OB protein, analog or derivative 
of any of subparts (a) through (h) in a pharmaceutical^ 
acceptable carrier. • 

OB proteins, analogs and related molecules are 
35 also reported in the following publications; however, no 
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representation is made with regard to the activity of 
any composition reported: 

U.S. Patent Nos. 5,521,283; 5,532,336; 
5,552,522; 5,552,523; 5,552,524; 5,554,727; 
5 5,559,208; 5,563,243; 5,563,244; 5,563,245; 

5,567, 678; 5,567,803; 5,569,744; 5,569,743 

(all assigned to Eli Lilly and Company) ; 
PCT W096/23517; W096/23515; W096/23514; 
WO96/24670; W096/23513; W096/23516; 
10 W096/23518; W096/23519; WO96/23520; 

W096/23815; WO96/24670; W096/27385 (all 

assigned to Eli Lilly and Company) ; 
PCT WO96/22308 (assigned to Zymogenetics) ; 
PCT WO96/29405 (assigned to Ligand 
15 Pharmaceuticals, Inc.); 

PCT W096/31526 (assigned to Amyin 

Pharmaceuticals , Inc . ) ; 
PCT W096/34885 (assigned to Smithkline Beecham 
PLC) ; 

20 PCT W096/35787 (assigned to Chiron) ; 

EP 0 725 079 (assigned to Eli Lilly and 
Company) ; 

EP 0 725 078 (assigned to Eli Lilly and 
Company) ; 

25 EP 0 736 599 (assigned to Takeda) ; 

EP 0 741 187 (assigned to F. Hoffman LaRoche) . 

To the extent these references provide for 
useful OB proteins or analogs or derivatives thereof, or 

30 associated compositions or methods, such compositions 
and/or methods may be used in conjunction with the 
present OB receptor proteins, such as for co- 
administration (together or separately, in a selected 
dosage schedule) or by complexing compositions to the 

35 present OB protein receptors. With the above provisos, 
these publications are herein incorporated by reference. 



WO 97/25424 



PCT/US97/00128 



- 29 - 



10 



15 



Derivative anH PorfflnlpH^ 

The present OB protein receptor and/or OB 
protein (herein the term "protein" is used to include 
"peptide" and OB protein or receptor analogs, such as 
those recited infra,, unless otherwise indicated) may 
also be derivatized by the attachment of one or more 
chemical moieties to the protein moiety. if the present 
pharmaceutical compositions contain as the active 
ingredient a complex of OB protein receptor and OB 
protein, one or both of such proteins may be 
derivatized. The chemically modified derivatives may be 
further formulated for intraarterial, intraperitoneal, 
intramuscular, subcutaneous, intravenous, oral, nasal, 
pulmonary, topical or other routes of administration. 
Chemical modification of biologically active proteins 
has been found to provide additional advantages under 
certain circumstances, such as increasing the stability 
and circulation time of the therapeutic protein and 
20 decreasing immunogenicity . £ae. U.S. Patent 

No. 4,179, 337, Davis et al-., issued December 18, 1979. 
For a review, Abuchowski et al., in Enzymes as 

Drugs. (j.s. Holcerberg and J. Roberts, eds . 
pp. 367-383 (1891)). a review article describing 
protein modification and fusion proteins is Francis, 
Focus on Growth Factors 3_: 4-10 (May 1992) (published by 
Mediscript, Mountview Court, Friern Barnet Lane, London 
N20, OLD, UK) . 

Preferably, for therapeutic use of the 
end-product preparation, the chemical moiety for 
derivatization will be pharmaceutical^ acceptable. A . 
polymer may be used. One skilled in the art will be 
able to select the desired polymer based on such 
considerations as whether the polymer/protein conjugate 
will be used therapeutically, and if so, the desired 
dosage, circulation time, resistance to proteolysis, and 



25 



30 



35 
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other considerations. For the present proteins and 
peptides, the effectiveness of the derivatization may be 
ascertained by administering the derivative, in the 
desired form (i.e., by osmotic pump, or by injection or 
5 infusion, or, further formulated for oral, pulmonary or 
nasal delivery, for example) , and observing biological 
effects as described herein. 

The chemical moieties suitable for 
derivatization may be selected from among various water 

10 soluble polymers. The polymer selected should be water 
soluble so that the protein to which it is attached so 
that it is miscible in an aqueous environment, such as a 
physiological environment. The water soluble polymer 
may be selected from the group consisting of, for 

15 example, polyethylene glycol, copolymers of ethylene 
glycol/propylene glycol, carboxymethylcellulose, 
dextran, polyvinyl alcohol, polyvinyl pyrolidone, 
poly-1, 3-dioxolane, poly-1, 3, 6-trioxane, 
ethylene/maleic anhydride copolymer, polyaminoacids 

20 (either homopolymers or random or non-random copolymers 
( see supra regarding fusion molecules) , and dextran or 
poly(n-vinyl pyrolidone) polyethylene glycol, propylene 
glycol homopolymers, polypropylene oxide/ethylene oxide 
co-polymers, polyoxyethylated polyols, 

25 polystyrenemaleate and polyvinyl alcohol. Polyethylene 
glycol propionaldenhyde may have advantages in 
manufacturing due to its stability in water. 

Fusion proteins may be prepared by attaching 
polyaminoacids to the OB protein receptor or OB protein 

30 (or analog or complex) moiety. For example, the 

polyamino acid may be a carrier protein which serves to 
increase the circulation half life of the protein. For 
the present therapeutic or cosmetic purposes, such 
polyamino acid should be those which do not create 

35 neutralizing antigenic response, or other adverse 

response. Such polyamino acid may be selected from the 
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group consisting of serum album (such as human serum 
albumin) , an antibody or portion thereof (such as an 
antibody constant region, sometimes called "F c w ) or 
other polyamino acids. As indicated below, the location 
5 of attachment of the polyamino acid may be at the 

N-terminus of the OB protein moiety, or other place, and 
also may be connected by a "chemical "linker" moiety to 
the OB protein. 

The polymer may be of any molecular weight, 

10 and may be branched or unbranched. For polyethylene 

glycol, the preferred molecular weight is between about 
2 kDa and about 100 kDa (the term "about" indicating 
that in preparations of polyethylene glycol, some 
molecules will weigh more, some less, than the stated 

15 molecular weight) for ease in handling and 

manufacturing. Other sizes may be used, depending on 
the desired therapeutic profile (e.g., the duration of 
sustained release desired, the effects, if any on 
biological activity, the ease in handling, the degree or 

20 lack-of antigenicity and other known effects of the 

polyethylene glycol to a therapeutic protein or analog) . 

The number of polymer molecules so attached 
may vary, and one skilled in the art will be able to 
ascertain the effect on function. One may mono- 

25 derivatize, or may provide for a di-, tri-, tetra- or 
some combination of derivatization, with the same or 
different chemical moieties (e.g., polymers, such as 
different weights of polyethylene glycols) . The 
proportion of polymer molecules to protein (or peptide) 

30 molecules will vary, as will their concentrations in the 
reaction mixture. In general, the optimum ratio (in 
terms of efficiency of reaction in that there is no 
excess unreacted protein or polymer) will be determined 
by factors such as the desired degree of derivatization 

35 (e.g., mono, di-, tri-, etc.), the molecular weight of 
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the polymer selected, whether the polymer is branched or 
unbranched, and the reaction conditions. 

The chemical moieties should be attached to 
the protein with consideration of effects on functional 
5 or antigenic domains of the protein. There are a number 
of attachment methods available to those skilled in the 
art, E.g. . EP 0 401 384 herein incorporated by reference 
(coupling PEG to G-CSF) , see also Malik et al., Exp. 
Hematol. 2£: 1028-1035 (1992) (reporting pegylation of 

10 GM-CSF using tresyl chloride) . For example, 

polyethylene glycol may be covalently bound through 
amino acid residues via a reactive group, such as, a 
free amino or carboxyl group. Reactive groups are those 
to which an activated polyethylene glycol molecule (or 

15 other chemical moiety) may be bound. The amino acid 
residues having a free amino group may include lysine 
residues and the N-terminal amino acid residue. Those 
having a free carboxyl group may include aspartic acid 
residues, glutamic acid residues, and the C-terminal 

20 amino acid residue. Sulfhydrl groups may also be used 
as a reactive group for attaching the polyethylene 
glycol molecule (s) (or other chemical moiety). 
Preferred for therapeutic manufacturing purposes is 
attachment at an amino group, such as attachment at the 

25 N-terminus or lysine group. Attachment at residues 
important for receptor binding should be avoided if 
receptor binding is desired. 

One may specifically desire N-terminally 
chemically modified protein. Using polyethylene glycol 

30 as an illustration of the present compositions, one may 
select from a variety of polyethylene glycol molecules 
(by molecular weight, branching, etc.), the proportion 
of polyethylene glycol molecules to protein molecules in 
the reaction mix, the type of pegylation reaction to be 

35 performed, and the method of obtaining the selected 

N-terminally pegylated protein. The method of obtaining 
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15 



the N-terminally pegylated preparation (i.e., 
separating this moiety from other monopegylated moieties 
if necessary) may be by purification of the N-terminally 
pegylated material from a population of pegylated 
5 protein molecules. Selective N-terminal chemical 

modification may be accomplished by reductive alkylation 
which exploits differential reactivity of different 
types of primary amino groups (lysine versus the 
N-terminal) available for derivatization in a particular 
10 protein. ^ PCT WO 96/11953, herein incorporated by 
reference. Under the appropriate reaction conditions, 
substantially selective derivatization of the protein' at 
the N-terminus with a carbonyl group containing polymer 
is achieved. For example, one may selectively 
N-terminally pegylate the protein by performing the 
reaction at a pH which allows one to take advantage of 
the P K a differences between the e-amino group of the 
lysine residues and that of the a-amino group of the 
N-terminal residue of the protein. By such selective 
derivatization, attachment of a polymer to a protein is 
controlled: the conjugation with the polymer takes 
Place predominantly at the N-terminus of the protein and 
no significant modification of other reactive groups, 
such as the lysine side chain amino groups, occurs. 
Using reductive alkylation, the polymer may be of the 
type described above, and should have a single reactive 
aldehyde for coupling to the protein. Polyethylene 
glycol propionaldehyde, containing a single reactive 
aldehyde, may be used. 

An N-terminally chemically modified derivative 
is preferred (over other forms of chemical modification) 
for ease in production of a therapeutic. N-terminal 
chemical modification ensures a homogenous product as 
characterization of the product is simplified relative 
35 to di-, tri- or other multi-derivatized products. The 
use of the above reductive alkylation process for 
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preparation of an N-terminally chemically modified 
product is preferred for ease in commercial 
manu f actur ing . 

In yet another aspect of the present 
5 invention, provided are methods of using pharmaceutical 
compositions of the proteins, and derivatives. Such 
pharmaceutical compositions may be for administration by 
injection, or for oral, pulmonary, nasal, transdermal or 
other forms of administration. In general, comprehended 
10 by the invention are pharmaceutical compositions 

comprising effective amounts of protein or derivative 
products of the invention together with pharmaceutical^ 
acceptable diluents, preservatives, solubilizers, 
emulsifiers, adjuvants and/or carriers- Such 
15 compositions include diluents of various buffer content 
(e.g., Tris-HCl, acetate, phosphate), pH and ionic 
strength; additives such as detergents and solubilizing 
agents (e.g., Tween 80, Polysorbate 80), anti-oxidants 
(e.g., ascorbic acid, sodium metabisulf ite) , 
20 preservatives (e.g. , Thimersol, benzyl alcohol) and 
bulking substances (e.g., lactose, mannitol) ; 
incorporation of the material into particulate 
preparations of polymeric compounds such as polylactic 
acid, polyglycolic acid, etc. or into liposomes. SSS./ 
25 e.g. . PCT W096/29989, Collins et al., "Stable protein: 
phospholipid compositions and methods," published 
October 3, 1996, herein incorporated by reference. 
Hylauronic acid may also be used, and this may have the 
effect of promoting sustained duration in the 
30 circulation. Such compositions may influence the 

physical state, stability, rate of in vjvg release, and 
rate of in vivo clearance of the present proteins and 
derivatives. See , e.g .. Remington's Pharmaceutical 
Sciences, 18th Ed. (1990, Mack Publishing Co.., Easton, 
35 PA 18042) pages 1435-1712 which are herein incorporated 
by reference. The compositions may be prepared in 
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35 



liquid form, or may be in dried powder, such as 
lyophilized form. Implantable sustained release 
formulations are also contemplated, as are transdermal 
formulations. 

5 Specifically contemplated are oral dosage 

forms of the above derivatized proteins. Protein may be 
chemically modified so that oral delivery of the 
derivative is efficacious. Generally, the chemical 
modification contemplated is the attachment of at least 
10 one moiety to the protein (or peptide) molecule itself 
where said moiety permits (a) inhibition of proteolysis- 
and (b) uptake into the blood stream from the stomach or 
intestine. Also desired is the increase in overall 
stability of the protein and increase in circulation 
time an the body. ^ PCT W095/21629, Habberfield, 
"Oral Delivery of Chemically Modified Proteins" 
(published August 17, 1995) herein incorporated by 
reference, and U.S. Patent No. 5,574,018, Habberfield et 
al., "Conjugates of Vitamin B12 and Proteins," issued 
November 12, 1996, herein incorporated by reference. 

Also contemplated herein is pulmonary delivery 
of the present protein, or derivative thereof. The 
protein (derivative) is delivered to the lungs of a 
mammal while inhaling and traverses across the lung 
epithelial lining to the blood stream. See, PCT 
WO94/20069, Niven et al., -Pulmonary administration of 
granulocyte colony stimulating factor, " published 
September 15,. 1994, herein incorporated by reference. 

Nasal delivery of the protein (or analog or 
derivative) i s also contemplated. Nasal delivery allows - 
the passage of the protein to' the blood stream directly 
after administering the therapeutic product to the nose, 
without the necessity for deposition of the product in 
the lung. Formulations for nasal delivery include those 
with absorption enhancing agents, such as dextran or 
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cyclodextran. Delivery via transport, across other 
mucous membranes is also contemplated. 



5 One skilled in the art will be able to 

ascertain effective dosages by administration and 
observing the desired therapeutic effect. Preferably, 
the formulation of the molecule or complex in a 
pharmaceutical composition will be such that between 
10 about .10 (xg/kg/day and 10 mg/kg/day will yield the 

desired therapeutic effect . The effective dosages may 
be determined using diagnostic tools over time. For 
example, a diagnostic for measuring the amount of OB 
protein or OB receptor protein in the blood (or plasma 
15 or serum) may first be used to determine endogenous 
levels of OB protein (or receptor) . Such diagnostic 
tool may be in the form of an antibody assay, such as an 
antibody sandwich assay. The amount of endogenous OB 
receptor protein (such as soluble receptor) is 
20 quantified initially, and a baseline is determined. The 
therapeutic dosages are determined as the quantification 
of endogenous and exogenous OB receptor protein (that 
is, protein, analog or derivative found within the body, 
either self-produced or administered) is continued over 
25 the course of therapy. The dosages may therefore vary 
over the course of therapy, with a relatively high 
dosage being used initially, until therapeutic benefit 
is seen, and lower dosages used to maintain the 
therapeutic benefits. 
30 During an initial course of therapy of an 

obese person, dosages may be administered whereby weight 
loss and concomitant fat tissue decrease increase is 
achieved. Once sufficient weight loss is achieved, a 
dosage sufficient to prevent re-gaining weight, yet 
35 sufficient to maintain desired weight or fat mass may be 
administered. These dosages can be determined 
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empirically, as the effects of OB protein are 
reversible. E,g, , Campfield et al., Science 2£2: 54 6-549 
(1995) at 547. Thus, if a dosage resulting in weight 
loss is observed when weight loss is not desired, one 
5 would administer a lower dose,, yet maintain the desired 
weight. 

Therapeutic Compositions and Mef.hnris 

The present OB receptor proteins, alone, or in 
10 combination with an OB protein, and nucleic acids may be 
used for methods of treatment, or for methods of 
manufacturing medicaments for treatment. Such treatment 
includes conditions characterized by excessive 
production of OB protein, wherein the present OB 

15 receptors, particularly in soluble form, may be used to 
complex to and therefore inactivate such excessive OB 
protein. Or, such OB receptor protein, particularly in 
soluble form, may act to protect the activity of OB 
protein. While not wishing to be bound by theory, one 

20 may postulate that OB protein receptor agonist activity 
may be accomplished by a protective effect achieved when 
OB protein receptor (particularly soluble receptor) is 
complexed to OB protein. Such effect may prolong the 
serum half life of OB protein in vivo . Such treatments 

25 may be accomplished by preparing soluble receptor (e.g., 
use of an extracellular domain as described suora ) and 
administering such composition to an individual in need 
thereof or by preparation of a population of cells 
containing or expressing such OB receptor, and 

30 transplanting such cells into the individual in need 
thereof. 

The present OB receptors may also be used for 
treatment of those having defective OB receptors. For 
example, one may treat an individual having defective OB 
35 receptors by preparation of a population of cells 
containing such non-defective OB receptor, and 
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transplanting such cells into an individual. Or, an 
individual may have an inadequate number of OB 
receptors, and cells containing such receptors may be 
transplanted in order to increase the number of OB 
5 receptors available to an individual. 

The present OB receptor proteins and related 
compositions such as OB receptor protein/OB protein 
complex, provide for weight loss, fat loss, increase in 
lean mass, increase in insulin sensitivity, increase in 

10 overall strength, increase in red blood cells (and 

oxygenation in the blood) , decrease in bone resportion 
or osteoporosis, decreased or maintained serum 
cholesterol level, decreased or maintained triglyceride 
(LDL or VLDL) levels, prevention or reduction in 

15 arterial plaque formation, treatment of hypertension, 

and prevention or reduction of gall stone formation. As 
body fat composition may be correlated with certain 
types of cancers, the present compositions may be useful 
for the prevention or amelioration of certain types of 

20 cancers. The present invention also includes methods 
for manufacture of a medicament for use in conjunction 
with the cosmetic/therapeutic conditions described 
herein, containing at least one of the present 
compositions. 

25 The present compositions and methods may be 

used in conjunction with other medicaments, such as 
those useful for the treatment of diabetes (e.g., 
insulin or analogs thereof, thiazolidinediones or other 
antihyperglycemic agents, and possibly amylin or 

30 antagonists there of) , cholesterol and blood pressure 
lowering medicaments (such as those which reduce blood 
lipid levels or other cardiovascular medicaments) , and 
activity increasing medicaments (e.g., amphetamines) . 
Appetite suppressants may also be used (such as 

35 serotonin modulators and neuropeptide Y antagonists) . 
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Such administration may be simultaneous or may be in 
seriatim » 

In addition, the present methods may be used 
in conjunction with surgical procedures, such as 
5 cosmetic surgeries designed to alter the overall 
appearance of a body (e.g., liposuction or laser 
surgeries designed to reduce body mass, or implant 
surgeries designed to increase the appearance of body 
mass) . The health benefits of cardiac surgeries, such 

10 as bypass surgeries or other surgeries designed to 

relieve a deleterious condition caused by blockage of 
blood vessels by fatty deposits, such as arterial 
plaque, may be increased with concomitant use of the 
present compositions and methods. Methods to eliminate 

15 gall stones, such as ultrasonic or laser methods, may 
also be used either prior to, during or after a course 
of the present therapeutic methods. Furthermore, the 
present methods may be used as an adjunct to surgeries 
or therapies for broken bones, damaged muscle, or other 

20 therapies which would be improved by an increase in lean 
tissue mass . 

In yet another aspect, the present invention 
provides for methods of manufacture of a medicament for 
the treatment of obesity, type II diabetes, excess blood 

25 lipid, or cholesterol levels, increasing sensitivity to 
insulin, increasing . lean mass, and other conditions as 
set forth above. Also provided are solely cosmetic 
treatments for individuals wishing to improve appearance 
by weight loss, and more specifically, loss of fat 

30 deposits, even in the absence of any therapeutic 
benefit. 

Diagnostic Compositions and Methods 

As indicated supra . polypeptide products of 
35 the invention may be "labeled" by association with a 
detectable marker substance (e.g., radiolabeled with 
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125j / fluorescent, chemi luminescent, enzyme) to provide 
reagents useful in detection and quantification of OB 
receptor (or complexes) in solid tissue and fluid 
samples such as blood or urine. Nucleic acid products 
5 of the invention may also be labeled with detectable 
markers (such as radiolabels and non-isotopic labels 
such as biotin) and employed in hybridization processes 
to locate the human OB receptor gene position and/or the 
position of any related gene family in a chromosomal 

10 map. Nucleic acid sequences which selectively bind the 
human OB receptor gene are useful for this purpose. 
They may also be used for identifying human OB receptor 
gene disorders at the DNA level and used as gene markers 
for identifying neighboring genes and their disorders. 

15 Such nucleic acid sequences may be sued for detection or 
measurement of OB receptor mRNA level from a biological 
sample. Contemplated herein are kits containing such 
labelled materials. . 

The protein and/or nucleic acids provided 

20 herein may also be embodied as part of a kit or article 
of manufacture. Contemplated is an article of 
manufacture comprising a packaging material and one or 
more preparations of the presently provided 
compositions. Such packaging material will comprise a 

25 label indicating that the protein or nucleic acid 

preparation is useful for detecting and/or quantifying 
the amount of OB receptor in a biological sample, or OB 
receptor defects in a biological sample. As such, the 
kit may optionally include materials to carry out such 

30 testing, such as reagents useful for performing DNA or 
RNA hybridization analysis, or PCR analysis on blood, 
urine, or tissue samples. 

A further embodiment of the invention is 
selective binding molecules, such as monoclonal 

35 antibodies selectively binding OB receptor. The 
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hybridoma technique described originally by Kohler and 
Milstein Eur, J. Immunol. £, 511-519 (1976) has been 
widely applied to produce hybrid cell lines that secrete 
high levels of monoclonal antibodies against many 
5 specific antigens. Recombinant antibodies, ( see Huse et 
al., Science 246 : 1275 (1989)) may also be prepared. 
Such recombinant antibodies may be further modified, 
such as by modification of complementarity determining 
regions to increase or alter affinity, or "humanizing" 

10 such antibodies. Such antibodies may be incorporated 
into a kit for diagnostic purposes, for example. A 
diagnostic kit may be employed to determine the location 
and/or amount or OB receptor of an individual. 
Diagnostic kits may also be used to determine if an 

15 individual has receptors which bind OB protein, or those 
which, to varying degrees, have reduced binding capacity 
or ability. As stated infra , such antibodies may be 
prepared using immunogenic portions of an OB receptor 
protein. Such selective binding molecules may 

20 themselves be alternatives to OB protein, and may be 
formulated for pharmaceutical composition. 

Such proteins and/or nucleic acids may be used 
for tissue distribution assays (for example, as provided 
in the working example below) or for other assays to 

25 determine the location of OB receptor. 

The present OB receptor protein family may be 
. used in methods to obtain OB protein analogs, mimetics 
or small molecules. One would simply prepare a desired 
OB receptor protein, particularly one with capability, of 

30 binding to native OB protein, and assay the test 

molecule, which may be labelled with a detectable label 
substance, for ability to bind to such receptor. Other 
parameters, such as affinity, and location of binding, 
may also be ascertained . by methods available to those 

35 skilled in the art. For example, one could use portions 
of the present OB receptors, particularly portions in 
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the extracellular domain which are necessary for ligand 
binding, to determine the location of such binding. One 
could prepare OB receptors which have various 
truncations or deletions of regions of the extracellular 
5 domain which could be used to determine the location of 
test molecule binding. One could use an OB receptor 
known to be defective in native OB binding, such as 
potentially one from an individual having such defective 
receptors, and use this as the basis for ascertaining OB 
10 protein which would be effective to result in desired 
biological activity (i.e., weight loss, reduction in 
blood dyslipidemias or lowering of cholesterol levels, 
reduction in incidence or severity of diabetes) . Other 
uses include solely cosmetic uses for alteration of body 
15 appearance, particularly the removal of fat. 

The present OB receptor protein or nucleic 
acids may also be useful to identify substances which 
"up-regulate" OB protein or receptor. For instance, the 
temporal expression of OB receptor in vivo may be useful 
20 to determine if an administered substance causes an 

increase or decrease in OB receptor. One may conclude 
that an increase in OB receptor expression results in 
modultion of weight or lipid metabolism. 

The divergence in the C-terminus may represent 
25 OB receptors with different signal transduction 

abilities. Therefore the different receptor family 
members may be used for different assays, depending on 
the type of signal transduction observed. It is thought 
that at least a portion of the intracellular domain is 
30 necessary for signal transduction (see svprfl) • 

The following examples are offered to more 
fully illustrate the invention, but are not to be 
construed as limiting the scope thereof. 
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EXAMPLE 1; IDENTIFICATION OF HUMAN OB RECEPTOR PROTEIN 

Human OB receptor protein DNA was identified 
5 in a human liver cDNA library in two steps. The first 
step used two primers in polymerase chain reaction (PCR) 
to amplify a selected 300 base pair region from the 
human liver cDNA library. The second step used the PCR 
fragment as a probe to screen the human liver cDNA 

10 library. Thirteen clones were obtained, but these were 
incomplete at the 5' end. A procedure was performed to 
complete the 5' end to make complete clones. Twelve 
clones were sequenced. These twelve clones were 
identified as either "A", "B" or "C" as denoted by the 

15 C-terminus of the predicted amino acid sequence. 

-Polymerase Chain Ppprtjn n 

The original PCR primer was based on the 5' 
end and the 3 1 end of a 416 base pair sequence having 
20 GenBank Database Accession No. T7384 9. This sequence 
was selected on the basis of a known motif present in 
cytokine receptors, "WSXWS". 

The 5* primer had the sequence 73-96 of the 
416 bp sequence. The 3* primer had the sequence 337-360 
25 of the 416 bp sequence. 

These primers were used to probe a human cDNA 
liver library (Stratagene) . Standard methods were used. 

This resulted in a PCR fragment having the 
sequence 73-360 of the 416 bp fragment. 

30 

Hvbridi 7^jo n 

The -300 bp PCR fragment was used to probe a 
human liver cDNA library (Stratagene) using standard 
methods. This second hybridization resulted in 13 
35 positive clones. These were partial clones, incomplete 
at the 5 f end. 
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rnmplfition of the 5' end. 

Rapid Amplification of cDNA End ("RACE", kit, 
GIBCO/BRL) was used to obtain the full length clones. 

5 

Sequencing results. 

Sequencing revealed the three types of OB 
receptor DNAs. Of the thirteen clones, 4 clones were 
the "A" type (Seq. ID Nos. 1 and 2); 1 clone was the "B" 

10 type (Seq. ID Nos. 3 and 4) and 4 clones were of the "C" 
type (Seq. ID Nos. 5 and 6) . 

As can be seen from the Sequence 
Identifications (below), OB receptor A is 8 96 amino 
acids long, "B" is 904 amino acids long, and "C" is 958 

15 amino acids long. These different OB receptors are 
identical at amino acid positions 1-891, and diverge 
almost completely beginning at position 892.. The leader 
sequence is postulated to be, by hydrophobicity 
analysis, amino acids 1-21 (M-A) , 1-22 (M-F) or 1-28 (M-I) , 

20 with the mature protein beginning at positions 22 (F) , 
23 (N) or 29 (T) . Based on hydrophobicity analysis, the 
leader sequence is most likely to be at positions 1-21 (M 
through A) . Chinese Hamster Ovary Cell ("CHO") cell 
production of the secreted form of OB receptor protein 

25 also produced a protein having amino acid number 22 as 
the first amino acid of the mature protein. The 
transmembrane region is likely to begin at either 
position 840 (A) or 842 (L) through position 862(1), 
863(S) or 864 (H). For OB receptor type "A", the last 

30 amino acid is located at position 896 and is a lysine 
(L) . For OB receptor type "B", the last amino acid is 
located at position 904 and is a glutamine (Q) . For OB 
receptor type "C" , the last amino acid is located at 
position 958 and is glutamic acid (E) . 

35 For OB receptor protein type "C", the C- 

terminal region possesses high homology to a known human 
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transposable element. From nucleotide 2737 through 2947 
of the present human OB receptor protein type "C", there 
is a 98.1% homology with a 211 base section of a human 
retrotranspbsable element described in Ono et al., Nucl. 
5 Acids Res. 1£: 8725-8737 (1987) (bases 520 through 731, 
SINE-R11, GENBANK accession no. x07417) . 

EXAMPLE 2: TISSUE DISTRIBUTION 

10 Tissue distribution was ascertained using two 

methods. The first method involved using the entire 
type "A" OB receptor. The second method involved using 
probes which are specific to the C-terminal region of 
the protein. Since these C terminal regions are diver- 
15 gent, the second method detected the tissue distribution 
of the different members of the OB receptor family. 

The first method used a Northern Blot kit 
(Clontech) , using the entire type A OB receptor DNA as a 
probe. The second method used PCR with primers specific 
to the nucleic acids encoding the divergent C terminus 
of the three types. Standard methods were used. 

Table 2 shows the results for the Northern 
Blot and the PCR methods. The " + " indicates the 
investigator's subjective determination of the strength 
25 of signal. For the Northern Blot analysis, a triple 

"+++" indicates that a result (a dark "band" on the x- 
ray film) was seen upon overnight exposure of the film. 
A double "++" indicates that bands were seen at two 
weeks of exposure. A single "+" indicates that the 
bands were seen after three weeks of exposure. In 
addition, using this method, two molecular weights were 
observed, one at 4 Kb and one at 6.2 Kb. Although 
distribution was ubiquitous, the strongest signals were 
seen for ovary, heart and liver. For the PCR analysis, 
35 OB receptor "A" was seen in all tissue types tested 

(prostate, ovary, small intestine, heart, lung, liver 
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and skeletal muscle) , type "B" was seen only in lung and 
liver, and type "C" was seen in ovary, heart, lung and 
liver* 

5 Table 2 

Pisi-yilin^ion of the Novel QB Receptor 





Knrthern Blot 








4 Kb 


6.2 Kb 




A 


B 


C 


Spleen 




+ 










Thymus 




+ 










Prostate 




+ 




+ 






Testis 




+ 










Ovarv 




+ + + 




+ 




+ 


Small 
Intestine 




+ + 




+ 






Colon 














Peripheral 

blood 

Leukocyte 














Heart 




+ + + 




+ 




+ 


Brain 














Placenta 




+ 










Luna 


+ 






+ 


+ 


+ 


Liver 


+ + + 


+ + + 




+ 


+ 


+ 


Skeletal 
Muscle 




+ + 




+ 






Kidnev 




+ + 










Pancreas 




+ 











F.X AMPLE 3 : IDENTIFICATION OF HUMAN OB RECEPTOR GENOMIC 
DNA AND CHROMOSOME LOCALIZATION; IDENTIFICATION OF HUMAN 
OB RECEPTOR "D" 

15 

The full length human OB receptor genomic DNA 
was also prepared. OB receptor "A" cDNA, in its 
entirety, was used as a probe against a human genomic 
DNA library, using materials and methods from a 
20 commercially available kit (Genome Systems, using a 
human genomic library in a PI vector) . A single 



WO 97/25424 



PCT/US97/00128 



- 47 - 

positive clone was detected. There are introns located 
at (with respect to OB receptor "A" DNA) base pair 
number: 559, 1059, 1350, 1667, 1817, 1937, 2060, 2277, 
2460, 2662, and 2738. 
5 The human OB receptor gene was localized to 

human chromosome 1P31 by FISH analysis (Genome Systems) . 
Human chromosome 1 is thought to correspond to mouse 
chromosome 4C7, which is presumed to be the location of 
the db locus. 

10 A further chromosomal sequence was isolated. 

This chromosomal DNA sequence was isolated from a human ■ 
genomic library as described above. This chromosomal 
sequence encodes what is here denominated human OB 
receptor "D", and the encoded amino acid sequence is set 

15 forth in SEQ. ID No. 7. A cDNA encoding this amino acid 
sequence is set forth in SEQ. ID No. 8. The chromosomal 
DNA intron/exon junction map is set forth as SEQ. ID 
No. 9. 

As with forms "A", "B", and "C", for the 
20 present form "D" OB receptor protein, the first amino 
acid of the mature protein is likely (using 
hydrophobicity analysis) to begin at position 22 (F) , 23 
(N) or 29 (T) . The last amino acid of the protein is at 
position 1165 and is a valine residue. As with the 
25 other forms, the extracellular domain extends from 

position 22 (F) , 23 (N) or 29 (T) to position 839 (D) or 
841 (G) . The transmembrane domain appears to begin at 
position 840 (A) or 842 (L) . The end of the 
transmembrane domain appears to be located at position 
30 862 (I), 863 (S) or 864 (H) . The C-terminal region, 
beyond the transmembrane region, is likely to be 
involved in signal transduction, and is located at 
position 863 (S) , 864 (H) or 865 (Q) through position 
11.65 (V) . 

35 The present OB receptor form "D" is identical 

to that published by Tartaglia et al, Cell £2: 1263-1271 
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(December 29, 1995) with the exception of a single amino 
acid change at amino acid position 976 (nucleotide codon 
begining at position 3022) . The present type "D" amino 
acid at position 976 is aspartic acid, and the published 
5 amino acid corresponding to the same position is 

alanine. This is a non- conservative substitution, see 
infra , and since the location of the substitution is 
within a region thought important for signal trans- 
duction, this change could affect the function of the 
10 molecule. 

EXAMPLE 4 : PREPARATION OF SOLUBLE OB RECEPTOR 

Three forms of soluble human OB receptor have 
IS been prepared: 

1. Leader + Extracellular Domain (Seq v 
ID Nos. 10 and 11) : A recombinant form of the soluble 
human OB receptor was prepared- This form encompasses, 
in the immature protein, the leader sequence and the 

20 extracellular domain (amino acids 1-839) . The mature 

protein would have the leader sequence deleted, and the 
first amino acid of the mature recombinant soluble human 
OB receptor would be 22 (F) , 23 (N) or 29 (T) . This 
protein was expressed as described below. 

25 2. Leader + Extracellular Domain 4- C- 

terminal FLAG (Seq. ID No, 12) : A second form of the 
recombinant soluble human OB receptor was also prepared. 
This form had a "FLAG" tag located at the "C" terminus 
of the protein. The "FLAG" peptide is a useful research 

30 tool as it allows one to follow the protein using an 
antibody which recognizes the "FLAG" peptide. Such 
reagents are commercially available (IBI, New Haven, 
CT) . This protein was expressed as described below. 

3. Native Sp line Variant (Seq. ID Nos. 

35 13 and 14) : This form is believed to the the 

recombinant form of a naturally occurring secreted, 
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soluble human OB receptor. This form has most of the 
amino acids found in the extracellular domain (amino 
acids 22-798) , and a unique 6 amino acid sequence at the 
carboxyl terminus. Beginning at amino acid position 799 
of Seq. ID No. 13, the amino acid sequence of this 
native splice variant human OB receptor protein is "G K 
F T I L." 

F,y AMPLE 5 : PREPARATION OF EXPRESSION VECTORS 



Recombinant human OB receptor expression 
vectors have been prepared for expression in mammalian 
cells. As indicated above, expression may also be in 
non-mammalian cells, such as bacterial cells. The type 
15 "A" cDNA (Seq. ID No. 2) was placed into a commercially 
available mammalian vector ( P CEP4, Invitrogen) for 
expression in mammalian cells, including the commercial- 
ly available human embryonic kidney cell line, "293". 

Recombinant human OB receptor expression 
20 vectors have been prepared for expression of recombinant 
soluble OB receptor, consisting of the leader sequence 
and the extracellular domain (Seq. ID Nos. 10 and 11), 
using the same system as above (the commercialy 
available mammalian vector P CEP4, and "293" cells) . 
25 This recombinant soluble human OB receptor was also 
expressed in CHO cells in a similar way. 

The "FLAG-tagged" form (Seq. ID No. 12) of the 
recombinant soluble human OB receptor, and the "D" form 
(Seq. ID No. 7) were also expressed in "293" cells in a 
30 similar fashion as above. 

Detection of desired protein was accomplished 
using BIACORE (Pharmacia) analysis. This analysis is 
analogous to that described in Bartleyet al., Nature 
368 : 558-560 (1994) . 
35 Essentially, the BIACORE machine measures 

affinity interactions between two proteins. In this 
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case, the OB protein was immobilized on the machine, and 
conditioned media from cell lines expressing the OB 
receptor was added to the machine. Any receptor protein 
present in the conditioned media bound to the OB protein 
5 surface. The BIACORE machine gave a read-out indicating 
that receptor protein was being expressed. For 
recombinant soluble receptor (Seq. ID No. 10) expression 
in "293" cells, the read-out was 191.0 relative to a 
baseline readout of 0. For recombinant soluble receptor 
10 <SEq. ID No. 10) expression in CHO cells, the read-out 
was 150.9 relative to a baseline readout of 0. For 
recombinant soluble receptor with a C-terminal FLAG-tag 
(Seq. ID. No. 12), the read-out was 172.0 relative to a 
baseline of 0. 

15 For expression in bacterial cells, one would 

typically eliminate that portion encoding the leader 
sequence (e.g., potentially amino acids 1-21, 1-22 or 1- 
28) . One may add an additional methionyl at the 
N-terminus for bacterial expression. Additionally, one 

20 may substitute the native leader sequence with a 
different leader sequence, or other sequence for 
cleavage for ease of expression. 

EXAMPLE 6 : DEMONSTRATION OF SIGNAL TRANSDUCTION 

25 

This example demonstrates that the "D" form is 
active to produce a signal within a cell, whereas in the 
same cell type, the "A" form does not. The signal 
transduction assay was performed by the use of "2 93" 

30 cells transiently expressing either the "A" or the "D" 
form (see above for preparation of the "293" expression 
clones). Phosphorylation of molecules predicted to be 
involved in signal transduction within the cell was 
examined upon OB protein binding to the OB receptor 

35 protein tested. The results demonstrate that upon 

binding of OB protein to the extracellular domain, the 
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EXAMPLE 7 : USE OF SOLUBLE OB RECEPTOR AS A THERAPEUTIC 

This example demonstrates that soluble OB 
receptor protein acts to protect the activity of OB 
protein. Below, soluble OB receptor and/or OB protein 
was delivered to a mammal via "gene transplant" — that 
is, via bone marrow cells engineered to express the 
desired DNAs. When soluble OB receptor combined with OB 
protein was delivered, the animals lost more weight than 
delivery of OB protein alone. This demonstrates the 
protective activity of OB receptor protein. 

While not wishing to be bound by theory, one 
explanation of the mode of action is that soluble OB 
15 receptor protein acts to protect the OB protein in serum 
from agents or conditions which could diminish its 
activity* The protective action appears to increase 
circulating half-life of the protein. As such, the 
present example demonstrates that OB receptor either 
alone, or administered as a complex with OB protein (or 
analog or derivative thereof) could act as a therapeutic, 
agent. 

Materials and m^ft^o. 

25 1- Preparation of recombinan t ob retroviral 

vector Packing CelTc; 

Use of murine r>h> rPNn ■ Full length wild-type 
murine ob cDNA was amplified by the PCR using synthetic 
oligonucleotides designed from the published sequence 
30 Zhang et al., Nature 372; 425-432 (1994) .Linkers (An Ecp. 
RI linker and a £gl n linker) were used to facilitate 
' subcloning. 

Use Of soluble recombinant human o q re^pfnr 
£PiI&. Methods similar to those above were used. A 
35 construct containing the recombinant human soluble 

receptor of Seq. ID No. 10 was used, and modified with 



20 
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linkers to facilitate cloning (i.e., the addition of a 
Bal II restriction endonuclease recognition site) • 

Plarpmfint of desired cDNA into vector. PGR 
products were digested with EcoRI and Sgrlll and cloned 
5 into similarly-digested parental vector (pMSCV2.1) under 
the transcriptional control of the viral LTR promoter. 
The parental MSCV vector (supplied by R. Hawley, 
University of Toronto, Canada) was derived from MESV 
(murine embryonic stem cell virus) and contains a 

10 neomycin phosphotransferase resistance (neo r ) gene 
driven by an internal mouse phosphoglycerate kinase 
(PGK) promoter, as described- Hawley, et al, J. Exp. 
Med. 176 : 1149 -1163 (1992) . The parental plasmid 
pMSCV2.1 and pMSCV-OB were independently electroporated 

15 into the GP+E-86 packaging cell line (supplied by Dr. A. 
Bank, Columbia University, NY) Markowitz et al.. , J. 
Virol. £2.: 1120-1124 (1988) . Transient supernatants were 
harvested from electroporated populations and used to 
infect tunicamycin treated parental GP+E-86 cells. 

20 Tunicamycin treatment relieves the block to 

superinfection of the parental packaging cells. G418 
(0.78 mg/mL, 67% active, GIBCO Laboratories, Life 
Technologies, Inc., Grand Island, NY) resistant clones 
were selected from each infected population and titered 

25 by infection of NIH3T3 cells. Clones with the highest 
G418 resistant titer were expanded and frozen as 
aliquots. Each bone marrow infection and 
transplantation experiment used aliquots from the same 
passage of frozen viral packaging cells. Both the 

30 parental and ob packaging cell lines were tested for the 
presence of, and found to be free from, replication 
competent virus using a sensitive marker rescue assay. 
Moore, et al . , (1993) in: Gene Targeting: A Practical 
Approach, Joyner, Ed. (Oxford University Press, New 

35 York, NY) . 
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2. Production of Retroviral s«ppmaf^ ffi 
Recombinant virus -producing packaging cell lines were 
grown in 175cm2 tissue culture flasks in Iscove's 
Modified Dulbecco's Medium (IMDM) (GIBCO) , 10% (v/v) 

5 FBS, at 37°C. Sub-confluent (approximately 60%) 

monolayers of cells were fed with fresh medium 24h prior 
to harvest of virus-containing supernatants . Viral 
supernatant s were removed from packaging cell lines by 
aspiration, sterile filtered <0.45mM) and added directly 
10 to bone marrow cultures. Fresh aliquots of frozen 
packaging cell lines were thawed for use in each 
experiment . 

3. Bone Marrow Infection and Transplantation. 
Eight to 12-week old female C57BL/6J (+/ + ) or {obVob) 

15 mice were used as bone marrow donors and recipients. 

All mice were purchased from The Jackson Laboratory (Bar 
Harbor, ME) and housed under specific pathogen- free 
conditions in a vivarium in accordance with governmental 
regulations and institutional guidelines. 
20 Bone marrow cells were harvested from femurs 

and tibias of donor mice 4 days post 5-f luorouracil (5- 
FU, Sigma Chemical Co., St. Louis, MO) treatment (150 
mg/kg i.v. ). Bone marrow cells (6 X 10 5 /mL) were 
incubated in 150mm tissue culture dishes (30mL/dish) 
25 containing fresh viral .supernatant (as described above), 
15% FBS, 6 mg/mL polybrene (Sigma), 0.1% bovine serum 
albumin (BSA, Fraction V, Sigma), 2.5 ng/mL recombinant 
mouse IL-3 --(rmIL-3) , 100 ng/mL each of recombinant human 
IL-6 (rhIL-6), recombinant human IL-11 (rhIL-11) , and 
30 recombinant rat SCF (rrSCF) . All growth factors were 
produced by Amgen, Inc. (Thousand Oaks, CA) . Culture 
media were replaced daily for 3 days with fresh virus- 
containing supernatant and growth factors. 

At the end of the infection period, total non- 
35 adherent and adherent cells were washed and resuspended 
in 1% BSA-saline and transplanted into g-irradiated (12 
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Gy, Cs 137 ) mice. Each animal was transplanted with 2.5 
X 10 6 syngeneic cells. There were approximately 10 

animals per cohort. 

4 / Analysis of OR protein expression in 
5 fransfertfiri rfill^ and transplanted animals. For 

trans fected bone marrow cells, Western analysis was 
performed. Vector packaging cell supernatant was 
resolved by SDS-PAGE (16% acrylamide) , then transferred 
to Hybond-ECL (Amersham, Arlington Heights, IL) . The 
10 filter was incubated with affinity-purified rabbit a- 
mouse OB protein polyclonal antibody (Img/mL) in T-TBS 
buffer <20mM Tris-chloride, pH7.6, 137mM NaCI, 0.1% 
Tween20) at room temperature for 45 min. Horseradish 
peroxidase (HRP) -con jugated donkey a-rabbit IgG 
15 (Amersham) was diluted in T-TBS (1:2500) and incubated 
with the filter at room temperature for 45 min. 
Enhanced chemi luminescence (ECL, Amersham) detection was 
performed as recommended by the manufacturer. 

For transplanted animals, serum was analyzed. 
20 Animals were bled retroorbitally, under isofluorane 

anesthesia. Serum from transplanted ob/ob animals was 
resolved by SDS-PAGE (4-20% acrylamide) under non- 
reducing and reducing conditions, then transferred to 
Trans-Blot (Bio-Rad Laboratories, Hercules, CA) 
25 membranes. The membranes were incubated for 2 hours at 
room temperature with HRP-con jugated rabbit a-mouse OB 
protein antibody (0.125mg/mL) in T-TBS buffer containing 
5% fetal bovine serum and 1% bovine serum albumin. 
Bound OB protein was detected by ECL (Amersham) , 
30 performed as recommended by the manufacturer. 

For quantitation of soluble OB protein levels, 
serum from transplanted animals was subjected to ELISA 
analysis. Briefly, affinity-purified rabbit a-OB 
protein polyclonal antibody was coated onto 96-well 
35 plates. Standards (purified recombinant OB protein 
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monomer, Pelleymounter et al., Science 233.: 54 0-543 
(1995) and experimental samples were added, and the 
plates were incubated at room temperature. The plates 
were washed 'twice and affinity-purified rabbit a-OB 
protein antibody conjugated to horseradish peroxidase 
was added. Following incubation at room temperature, 
the plates were washed four times with TNE-Tween20. 
TMB/peroxide substrate was added and the color reaction 
was read at 450nm in a Molecular Devices plate reader. 
OB protein concentrations in sera were estimated by 
comparison to a standard curve prepared from internal 
standards. OB protein levels were reliably measured in 
samples containing >160 pg/mL. 

5- Body Weight and Food Tnf»if» 

Mi ce were 

15 offered pelletized rodent chow (PMI Feeds, Inc., St. 
Louis, MO) ad Ubimm. The body weight of individual 
animals was measured daily for the first two months of 
analysis, and weekly thereafter. Food consumption was 
measured daily on selected groups of individually-housed 

20 animals. 



Result., 6 ? 

Results are presented in Tables 4 and 5 below. 
Administration of OB protein receptor increased the 

25 effectiveness of OB protein. This may have been 

accomplished via an increased circulation time of OB 
protein in the presence of OB protein receptor. 

As can be seen in the Table, animals 
administered a combination of OB protein and OB protein 

30 receptor (via genetic therapy) had a greater weight loss 
after 28 days than either composition alone. The Table 

presents the results of two experiments (" / ■•) . as 

can be seen, use of the OB protein alone at day 40 
resulted in animals with 87.5% and 72.2% of the starting 

35 weight. Using OB receptor in combination with OB 
protein, however, resulted in animals with 68% and 
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53.6% of the starting weight. Use of the receptor alone 
appeared to have little effect, if any. 

TABLE 4 



Treatment 


Weight (g) 
decrease at 
day 28 (ave) 


% starting 
weight 
(ave) 
dav 28 


% starting 
weight 

(ave) 
day 40 


OB alone* 


6.3/12.7 


87.9/75.3 


87.5/72.2 


Receptor** 
alone 


[1.4] / [0.3] 


103/100.6 


104.2/101.7 


OB + 
Receptor*** 


12.6/16.8 


76.3/67.5 


68/53.6 



* 50% bone marrow cells transfected with OB protein 
cDNA as described above, and 50% bone marrow cells 
without genetic alteration 

** 50% bone marrow cells transfected with OB receptor 
10 protein cDNA as described above, and 50% bone marrow 
cells without genetic alteration 

*** 50% bone marrow cells transfected with OB protein 
cDNA as described above, and 50% bone marrow cells 
transfected with OB receptor protein cDNA as described 
15 above. 

Table 5, below, contains results of the OB 
levels found in the serum from animals administered OB 
protein alone, or administered OB protein in combination 
with OB protein receptor (via the "gene therapy" method 
20 of this example) . The data reflect nanograms of OB 
protein per milliliter of serum, plus or minus the 
standard error of the mean. 
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TABLE 5 



10 



15 



20 



25 



Treatment 


Experiment #1* 


Experiment #2** 


OB alone* 


2.93 +/- 0.77 


9.74 +/- 1.02 


Receptor** 
alone 


0.08 +/- 0.05 


0.12 +/- 0.07 


OB + 
ReceDtor*** 


12.11 +/- 1.90 


15.18 +/- 2.52 



30 



50% bone marrow cells trans fected with OB protein 
cDNA as described above, and 50% bone marrow cells 
without genetic alteration 

** 50% bone marrow cells transfected with OB receptor 
protein cDNA as described above, and 50% bone marrow 
cells without genetic alteration 

*** 50% bone marrow cells transfected with OB protein 
cDNA as described above, and 50% bone marrow cells 
transfected with OB receptor protein cDNA as described 
above . 

* Experiment #1 was conducted as described above, 
with OB protein serum levels measured after 38 days. 
tt Experiment #2 was also conducted as described 
above, with OB protein serum levels measured after 24 
days . 

The data demonstrate the protective effects of 
OB receptor. As can be seen, in the presence of OB 
receptor, OB protein has a higher accumulation in the 
serum. The degree of accumulation is observed to 
increase inversely with the levels of OB protein in the 
serum. in Experiment #1 (with a base OB protein level 
of about 2.93 ng/ml) , the OB protein serum level 
increased about 400% with the addition of receptor, 
where in Experiment #2 (with a base of about 9.74), the 
OB protein serum level increased by about 25%. 

OB receptor administered either alone or in 
association with OB protein (or analogs or derivatives 
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thereof) may serve to increase the circulation time of 
OB protein, and therefore enhance the therapeutic 
efficacy of either exogenous or endogenous OB protein. 

5 F.XAMPLE 8 : PREPARATION OF SELECTIVE BINDING MOLECULES 
Animals were immunized for the preparation of 
polyclonal antibodies using the following peptides (with 
respect to the numbering of the amino acids for OB 
receptor A, Seq. ID No. 1) : 54-64; 91-100; 310-325; 
10 397-406; 482-496; 874-885; and, with respect to amino 

acids of OB receptor "C" (Seq. ID No. 5), 910-929. Some 
of the polyclonal antibodies prepared (in rabbits) were 
tested for ability to bind to recombinant human OB 
receptor protein. The polyclonal antibody prepared 
15 against amino acids 54-64 was found to have the highest 
affinity for recombinant human OB receptor protein. The 
polyclonal antibody prepared against amino acids 397-406 
was also found to bind to recombinant human OB receptor 
protein. The polyclonal antibody prepared against amino 
20 acids 91-100 was found to slightly bind to recombinant 
human OB receptor protein. The polyclonal antibody 
prepared against amino acids 874-885 was found not to 
bind to recombinant human OB receptor protein. 

An additional study was performed which 
25 demonstrates the expression and purification of the 

extracellular domain of the OB receptor protein in CHO 
cells, and antibodies which recognize this OB protein 
receptor extracellular domain. 

The extracellular domain of the human OB 
30 receptor protein was expressed as a secreted, soluble 
protein in CHO cells as previously described supra. 
Individual cell lines were isolated and grown in 
increasing amounts of methotrexate to increase 
selection/expression of the recombinant receptor protein 
35 (100, 200 or 500 micrograms methotrexate per ml of 

media) . Conditioned media from the CHO cell lines was 
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collected, and the proteins in the conditioned media 
„ere fractionated by SDS-BAGE. The OB ~«P« r 
extracellular domain migrated as a broad band w than 
apparent size range of about 140 *Da to about 200 KDa. 
s The OB receptor protein extracellular domain was 

detected by Western Blot analysis -^^tracel- 
antibodies prepared against a ^^ded, 

^ ^J^^ST-r^r- antigen to 
10 nttrra in rabbits. The identified OB receptor 

Intracellular domain was purified by affinity 
chromatography. The purified protein was sequenced at 
the amino terminus to confirm that it was the » 
receptor and also to determine the start of the mature 
15 protein .after signal peptide cleavage, as expresed 
rHO cells It was found that amino acid no. 22 

rrr^r: 

protein as expressed in CHO cells. 
,„ other immunogenic peptides may be used. 

Polyclonal, monospecific polyclonal, monoclonal, 
TntlLdy fragments, and recombinant \ he 
prepared using methods available to those silled 

25 art ' one may further use recombinant technics or 

peptide synthesis methods to alter the = 
^Crrr^L having altered 

30 usinHuLn -c constant, region. Other 

types of recombinant antibodies, for example, 

yp w-,^^^ a-Ffinitv or selectivity 

having CDR' s altered to enhance affxnrty 
to one more members of the OB receptor family, may b 
35 prepared' and used using methods available to those 
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skilled in the art. See Winter et al., Nature 349: 293- 
299 (1991) . 

The present OB receptor protein may be used as 
an assay to screen for desired selective binding 
5 molecules. Such assay may be based on binding 

capability, or biological activity, or, other means of 
detecting signal transduction. For example, if one were 
to prepare a series of modified antibodies, one could 
test them for affinity (i.e, binding strength) against 

10 the target OB receptor. 

The selective binding molecules may be useful 
for diagnostic purposes, such as tissue distribution 
analysis, or to diagnose the relative affinity of an 
individual's OB receptors for such selective binding 

15 molecule to determine the functionality of an 

individual's OB receptor during a course of therapy. 
Selective binding molecules may be alternative 
therapeutic or cosmetic products to OB protein. 

20 EXAMPLE 9 r CTNR THERAPY 

One may deliver the present OB receptor 
protein via gene therapy, as described jnt X$ ■ 

One may envision, using materials and methods 
available to those skilled in the art and provided 

25 herein, using T-cells as an agent carrying DNA 

expressing OB receptor for gene therapy. An individual 
would have T-cells selected using CD34+ selection and a 
magnetic microparticles selection device. Such cells 
would be transfected with the desired DNA, or the 

30 regulation of the desired coding region may be altered 
using homologous recombination or other in situ 
techniques. The transduced cells could be selected 
empirically, using means to detect the desired protein, 
or a marker may be included which permits indirect 

35 detection (i.e., a selectable marker as is known in the 
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. „«. t - could be expanded, for 
««. optionally. ^ o ^ t f . eeor . such as SC. or 

example, usin, one or ^ stored £or 

-•TTi rr «a -procedure woul. only 

:rrai: ~~ - - 

10 loss/maintenance of weight, diabetes 
lipid levels, or other conditions. 

^^^^ 
15 th ose to which reference has been - aster.cM 

indicates the position of a stop codon. 
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Hntnan OB Rpcapt.or "A" Ami no Arid Science (Sea. ID No ■ 1 (Amino 
a^iri. < s infl1*» Igf.tgr abbreviation); 





1 


MICQKFCWL 


LHWEFIYVIT 


AFNLSYPITP 


WRFKLSCMPP 


NSTYDYFLLP 


5 


51 


AGLSKNTSNS 


NGHYETAVEP 


KFNSSGTHFS 


NLSKTTFHCC 


FRSEQDRNCS 




101 


LCADNIEGKT 


FVSTVNSLVF 


QQIDANWNIQ 


CWLKGDLKLF 


ICYVESLFKN 


10 


151 


LFRNYNYKVH 


LLYVLPEVLE 


DSPLVPQKGS 


FQMVHCNCSV 


HECCECLVPV 




201 


PTAKLNDTLL 


MCLKITSGGV 


IFQSPLMSVQ 


PINMVKPDPP 


LGLHMEITDD 





251 


GNLKISWSSP 


PLVPFPLQYQ 


VKYSENSTTV 


IREADKIVSA 


TSLLVDSILP 


15 


301 


GSSYEVQVRG 


KRLDGPGIWS 


DWSTPRVFTT 


QDVIYFPPKI 


LTSVGSNVSF 




351 


HCIYKKENKI 


VPSKEIVWWM 


NLAEKIPQSQ 


YDWSDHVSK 


VTFFNLNETK 


20 


401 


PRGKFTYDAV 


YCCNEHECHH 


RYAELYVIDV 


NINISCETDG 


YLTKMTCRWS 




451 


TSTIQSLAES 


TLQLRYHRSS 


LYCSDIPSIH 


PISEPKDCYL 


QSDGFYECIF 


25 


501 


QPIFLLSGYT 


MWIRINHSLG 


SLDSPPTCVL 


PDSWKPLPP 


SSVKAEITIN 


551 


IGLLKISWEK 


PVFPENNLQF 


QIRYGLSGKE 


VQWKMYEVYD 


AKSKSVSLPV 




601 


PDLCAVYAVQ 


VRCKRLDGLG 


YWSNWSNPAY 


TWMDIKVPM 


RGPEFWRIIN 


30 


651 


GDTMKKEKNV 


TLLWKPLMKN 


DSLCSVQRYV 


INHHTSCNGT 


WSEDVGNHTK 




701 


FTFLWTEQAH 


TVTVLAINSI 


GASVANFNLT 


FSWPMSKVNI 


VQSLSAYPLN 


35 


751 


SSCVIVSWIL 


SPSDYKLMYF 


.IIEWKNLNED 


GEIKWLRISS 


SVKKYYIHDH 


801 


FIPIEKYQFS 


LYPIFMEGVG 


KPKIINSFTQ 


DDIEKHQSDA 


GLYVIVPVII 




851 


SSSILLLGTL 


LISHQRMKKL 


FWEDVPNPKN 


CSWAQGLNFQ 


KRTDIL*SLI 


40 


901 


MITTDEPNVP 


TSQQSIEY*K 


IFTF*RRGAN 


LKKIQLNF*E 


LTYGGLC*FR 




951 


T*NRCVNLGS 


KCRFESSLDV 


*L 
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1 CCGCCGCCAT CTCTGCCTTC GGTCGAGTTG GACCCCCGGA TCAAGGTGTA 

5 51 CTTCTCTGAA GTAAGATGAT TTGTCAAAAA TTCTGTGTGG TTTTGTTACA 
101 TTGGGAATTT ATTTATGTGA TAACTGCGTT TAACTTGTCA TATCCAATTA 
iq 151 CTCCTTGGAG ATTTAAGTTG TCTTGCATGC CACCAAATTC AACCTATGAC 
201 TACTTCCTTT TGCCTGCTGG ACTCTCAAAG AATACTTCAA ATTCGAATGG 
251 ACATTATGAG ACAGCTGTTG AACCTAAGTT TAATTCAAGT GGTACTCACT 
15 301 TTTCTAACTT ATCCAAAACA ACTTTCCACT GTTGCTTTCG GAGTGAGCAA 
351 GATAGAAACT GCTCCTTATG TGCAGACAAC ATTGAAGGAA AGACATTTGT 
2Q 401 TTCAACAGTA AATTCTTTAG TTTTTCAACA AATAGATGCA AACTGGAACA 
451 TACAGTGCTG GCTAAAAGGA GACTTAAAAT TATTCATCTG TTATGTGGAG 
501 TCATTATTTA AGAATC TAT T CAGGAATTAT AACTATAAGG TCCATCTTTT 
55 551 ATATGTTCTG CCTGAAGTGT TAGAAGATTC ACCTCTGGTT CCCCAAAAAG 
601 GCAGTTTTCA GATGGTTCAC TGCAATTGCA GTGTTCATGA ATGTTGTGAA 

Q 651 TGTCTTGTGC CTGTGCCAAC AGCCAAACTC AACGACACTC TCCTTATGTG 

701 TTTGAAAATC ACATCTGGTG GAGTAATTTT CCAGTCACCT CTAATGTCAG 

751 TTCAGCCCAT AAATATGGTG AAGCCTGATC CACCATTAGG TTTGCATATG 

5 801 GAAATCACAG ATGATGGTAA TTTAAAGATT TCTTGGTCCA GCCCACCATT 

851 GGTACCATTT CCACTTCAAT ATCAAGTGAA ATATTCAGAG AATTCTACAA 

3 901 CAGTTATCAG AGAAGCTGAC AAGATTGTCT CAGCTACATC CCTGCTAGTA 

951 GACAGTATAC TTCCTGGGTC TTCGTATGAG GTTCAGGTGA GGGGCAAGAG 

1001 ACTGGATGGC CCAGGAATCT GGAGTGACTG GAGTACTCCT CGTGTCTTTA 

5 1051 CCACACAAGA TGTCATATAC TTTCCACCTA AAATTCTGAC AAGTGTTGGG 

1101 TCTAATGTTT CTTTTCACTG CATCTATAAG AAGGAAAACA AGATTGTTCC 

1151 CTCAAAAGAG ATTGTTTGGT GGATGAATTT AGCTGAGAAA ATTCCTCAAA 

1201 GCCAGTATGA TGTTGTGAGT GATCATGTTA GCAAAGTTAC TTTTTTCAAT 

1251 CTGAATGAAA CCAAACCTCG AGGAAAGTTT ACCTATGATG CAGTGTACTG 



WO 97/25424 
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1301 CTGCAATGAA CATGAATGCC 

1351 ATGTCAATAT jCAATAT CTCA 

5 

1401 ACTTGCAGAT GGTCAACCAG 

1451 GCAATTGAGG TATCATAGGA 

10 1501 TTCATCCCAT ATCTGAGCCC 

1551 TATGAATGCA TTTTCCAGCC 

1601 GATTAGGATC AATCACTCTC 

15 

1651 TCCTTCCTGA TTCTGTGGTG 

1701 GAAATTACTA TAAACATTGG 

20 1751 CTTTCCAGAG AATAACCTTC 

1801 AAGAAGTACA ATGGAAGATG 

1851 GTCAGTCTCC CAGTTCCAGA 

25 

1901 CTGTAAGAGG CTAGATGGAC 

1951 CCTACACAGT TGTCATGGAT 

30 2001 TGGAGAATAA TTAATGGAGA 

2051 ACTTTGGAAG CCCCTGATGA 

2101 ATGTGATAAA CC AT CAT AC T 

35 

2151 GGAAATCACA CGAAAT TC AC 

2201 TACGGTTCTG GCCATCAATT 

40 2251 TAACCTTTTC ATGGCCTATG 

2301 GCTTATCCTT TAAACAGCAG 

2351 CAGTGATTAC AAGCTAATGT 

45 

2401 AAGATGGTGA AATAAAATGG 

245,1 TATATCCATG ATCATTTTAT 

50 2501 CCCAATATTT ATGGAAGGAG 

2551 CTCAAGATGA TATTGAAAAA 



66 - 

ATCATCGCTA TGCTGAATTA TATGTGATTG 
TGTGAAACTG ATGGGTACTT AACTAAAATG 
TACAATCCAG TCACTTGCGG AAAGCACTTT 
GCAGCCTTTA CTGTTCTGAT ATTCCATCTA 
AAAGATTGCT ATT TGCAGAG TGATGGTTTT 
AATCTTCCTA TTATCTGGCT ACACAATGTG 
TAGGTTCACT TGACTCTCCA CCAACATGTG 
AAGCCACTGC CTCCATCCAG TGTGAAAGCA 
AT TAT T G AAA ATATCTTGGG AAAAGCCAGT 
AATTCCAGAT TCGCTATGGT TTAAGTGGAA 
TATGAGGTTT ATGATGCAAA ATCAAAATCT 
CTTGTGTGCA GTCTATGCTG TTCAGGTGCG 
TGGGATATTG GAGTAATTGG AGCAATCCAG 
ATAAAAGTTC CTATGAGAGG ACCTGAATTT 
TACTATGAAA AAGGAGAAAA ATGTCACTTT 
AAAATGACTC ATTGTGCAGT GTTCAGAGAT 
TCCTGCAATG GAACATGGTC AGAAGATGTG 
TTTCCTGTGG ACAGAGCAAG CACATACTGT 
CAATTGGTGC TTCTGTTGCA AATTTTAATT 
AGCAAAGTAA ATATCGTGCA GTCACTCAGT 
TTGTGTGATT GTTTCCTGGA TACTATCACC 
ATTTTATTAT TGAGTGGAAA AATCTTAATG 
CTTAGAATCT CTTCATCTGT TAAGAAGTAT 
CCCCATTGAG AAGTACCAGT TCAGTCTTTA 
TGGGAAAACC AAAGATAATT AATAGTTTCA 
CACCAGAGTG ATGCAGGTTT ATATGTAATT 
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2601 




1 lArTTCCTC 


T TCCAT C T T A 


TTGCTTGGAA 


CATTATTAAT 




96S1 




ACaAATGAAAA 


AGCTATTTTG 


GGAAGATGTT 


CCGAACCCCA 








CTGGGCACAA 


GGACTTAATT 


TTCAGAAGAG 


AACGGACATT 




07C1 

j£ / -J X 




T AAT CATGAT 


CACTACAGAT 


GAACCCAATG 


TGCCAACTTC 


10 






ATAGAG TATT 


AGAAGATTTT 


TACATTTTGA AGAAGGGGAG 




LAAAl^ 1 AAA 


AAAAAT TCAG 


TTGAACTTCT 


GAGAGTTAAC 


ATATGGTGGA 




2901 


TTATGTTGAT 


TTAGAACTTA 


AAATAGATGT 


GTAAATTTGG 


GTTCAAAATG 


15 


2951 


TAGATTTGAG 


TCCAGTTTGG 


ATGTGTGATT 


AATTTTCAAA 


TCATCTAAAG 




3001 


TTTAAAAGTA 


GTATTCATGA 


TTTCTGGCTT 


TTGATTTGCC 


ATATTCCTGG 


20 


3051 


TCATAAAACA 


TTAAGAAAAT 


TATGGCTGTT 


GCTGTCATTA 


CATATCTATT 


3101 


AAAT GTCATC 


AAATATGTAG 


TAGACAATTT 


TGTAATTAGG 


TGAACTCTAA 




3151 


AACTGCAACA 


TCTGACAAAT 


TGCTTTAAAA ATACAATGAT 


TAT 
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fluman OB Receptor "B" Amino Acid Sequence (Sea. ID No. ? ( Amino 



10 



15 



20 



25 



30 



35 



40 



Acid) ) : 










1 


MICQKFCWL 


LHWEFIYVIT 


AFNLSYPITP 


WRFKLSCMPP 


NSTYDYFLLP 


51 


AGLSKNTSNS 


NGHYETAVEP 


KFNSSGTHFS 


NLSKTTFHCC 


FRSEQDRNCS 


101 


LCADNIEGKT 


FVSTVNSLVF 


QQIDANWNIQ 


CWLKGD LKLF 


ICYVESLFKN 


151 


LFRNYNYKVH 


LLYVLPEVLE 


DSPLVPQKGS 


FQMVHCNCSV 


HECCECLVPV 


201 


PTAKLNDTLL 


MCLKITSGGV 


IFQSPLMSVQ 


PINMVKPDPP 


LGLHMEITDD 


251 


GNLKISWSSP 


PLVPFPLQYQ 


VKYSENSTTV 


IREADKIVSA 


TSLLVDSILP 


301 


GSSYEVQVRG 


KRLDGPGIWS 


DWSTPRVFTT 


QDVIYFPPKI 


LTSVGSNVSF 


351 


HCIYKKENKI 


VPSKEIVWWM 


NLAEKIPQSQ 


YDWSDHVSK 


VTFFNLNETK 


401 


PRGKFTYDAV 


YCCNEHECHH 


RYAELYVIDV 


NINISCETDG 


YLTKMTCRWS 


451 


TSTIQSLAES 


TLQLRYHRSS 


LYCSDIPSIH 


PISEPKDCYL 


QSDGFYECIF 


501 


QPIFLLSGYT 


MWIRINHSLG 


SLDSPPTCVL 


PDSWKPLPP 


SSVKAEITIN 


551 


IGLLKISWEK 


PVFPENNLQF 


QIRYGLSGKE 


VQWKMYEVYD 


AKSKSVSLPV 


601 


PDLCAVYAVQ 


VRCKRLD GLG 


YWSNWSNFAY 


TVVMDIKVPM 


T5 r* T3 IT* CT.TO T TM 

KGP ErWRI I N 


651 


GDTMKKEKNV 


TLLWKPLMKN 


DSLCSVQRYV 


INHHTSCNGT 


WSEDVGNHTK 


701 


FTFLWTEQAH 


TVTVLAINSI 


GASVANFNLT 


FSWPMSKVNI 


VQSLSAYPLN 


751 


SSCVIVSWIL 


SPSDYKLMYF 


IIEWKNLNED 


GEIKWLRISS 


SVKKYYIHDH 


801 


FIPIEKYQFS 


LYP IFME GVG 


KPKIINSFTQ 


DDIEKHQSDA 


GLYVIVPVII 


851 


SSSILLLGTL 


LISHQRMKKL 


FWEDVPNPKN 


CSWAQGLNFQ 


KKRLSIFLSS 


901 


IQHQ*HWLF 


FWSLKQFQKI 


SVLIHHGKIK 


MR*CQQLWSL 


YFQQQILKRV 


951 


LFVLVTSSTV 


LTSLRLRVLR 


*PMRTKARDN 


PLLNTPR*SA 


TLNQVKLVK 



45 
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Human OB Receptor "F" PNA Sequent ( s*a. m n 0 . 4 mNnn . 





1 


CCGCCGCCAT 


' CTCTGCCTTC 


: GGTCGAGTTG 


! GACCCCCGGA 


> TCAAGGTGTA 


5 


51 


CTTCTCTGAA 


. GTAAGATGAT 


TTGTCAAAAA 


. TTCTGTGTGG 


TTTTGTTACA 




101 


TTGGGAATTT 


ATTTATGTGA 


TAACTGCGTT 


TAACTTGTCA 


TATCCAATTA 


10 


151 


CTCCTTGGAG 


ATTTAAGTTG 


TCTTGCATGC 


CACCAAATTC 


AACCTATGAC 


201 


TACTTCCTTT 


TGCCTGCTGG 


ACTCTCAAAG 


AATACTTCAA ATTCGAATGG 




251 


ACATTATGAG 


ACAGCTGTTG 


AACCTAAGTT 


TAATTCAAGT 


GGTACTCACT 


15 


301 


TTTCTAACTT 


ATCCAAAACA 


ACTTTCCACT 


GTTGCTTTCG 


GAGTGAGCAA 




351 


GATAGAAACT 


GCTCCTTATG 


TGCAGACAAC 


ATTGAAGGAA 


AGACATTTGT 


20 


401 


TTCAACAGTA 


AATTCTTTAG 


TTTTTCAACA 


AATAGATGCA 


AACTGGAACA 


451 


TACAGTGCTG 


GCTAAAAGGA 


GACTTAAAAT 


TATTCATCTG 


TTATGTGGAG 




501 


TCATTATTTA 


AGAATCTATT 


CAGGAATTAT 


AACTATAAGG 


TCCATCTTTT 


25 


551 


ATATGTTCTG 


CCTGAAGTGT 


TAGAAGATTC 


ACCTCTGGTT 


CCCCAAAAAG 




601 


GCAGTTTTCA 


GATGGTTCAC 


TGCAATTGCA 


GTGTTCATGA 


ATGTTGTGAA 


30 


651 


TGTCTTGTGC 


CTGTGCCAAC 


AGCCAAACTC 


AACGACACTC 


TCCTTATGTG 


701 


TTTGAAAATC 


ACATCTGGTG 


GAGTAATTTT 


CCAGTCACCT 


CTAATGTCAG 




751 


TTCAGCCCAT 


AAATATGGTG 


AAGCCTGATC 


CACCATTAGG 


TTTGCATATG 


35 


801 


GAAA.TCACAG 


ATGATGGTAA 


TTTAAAGATT 


TCTTGGTCCA 


GCCCACCATT 




851 


GGTACCATTT 


CCACTTCAAT 


ATCAAGTGAA 


ATATTCAGAG 


AATTCTACAA 


A C\ 

40 


901 


CAGTTATCAG 


AGAAGCTGAC 


AAGATTGTCT 


CAGCTACATC 


CCTGCTAGTA 




951 


G ACAG TAT AC 


TTCCTGGGTC 


TTCGTATGAG 


GTTCAGGTGA 


GGGGCAAGAG 




1001 


ACTGGATGGC 


CCAGGAATCT 


GGAGTGACTG 


GAGTACTCCT 


CGTGTCTTTA 


45 


1051 


CCACACAAGA 


T GT CAT ATAC 


TTTCCACCTA 


AAATTCTGAC 


AAGTGTTGGG 




1101 


TCTAATGTTT 


CTTTTCACTG 


CATCTATAAG 


AAGGAAAACA 


AGATTGTTCC 


50 


1151 


CTCAAAAGAG 


ATTGTTTGGT 


GGATGAATTT 


AGCTGAGAAA 


ATTCCTCAAA 


1201 


GCCAGTATGA 


TGTTGTGAGT 


GATCATGTTA 


GCAAAGTTAC 


TTTTTTCAAT 




1251 


CTGAATGAAA 


CCAAACCTCG 


AGGAAAGTTT 


ACCTATGATG 


CAGTGTACTG 
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1301 


CTGCAATGAA 


CATGAATGCC 


1351 


ATGTCAATAT 


CAATATCTCA 


1401 


ACTTGCAGAT 


GGTCAACCAG 


1451 


GCAATTGAGG 


TATCATAGGA 


1501 


TTCATCCCAT 


ATCTGAGCCC 


1551 


TATGAATGCA 


TTTTCCAGCC 


1601 


GATTAGGATC 


AATCACTCTC 


1651 


TCCTTCCTGA 


TTCTGTGGTG 


1701 


GAAATTACTA 


TAAACATTGG 


1751 


C T T T CC AG AG 


AATAACCTTC 


1801 


AAGAAGTACA 


ATGGAAGATG 


1851 


GTCAGTCTCC 


C AG T T C C AG A 


1901 


CTGTAAGAGG 


CTAGATGGAC 


1951 


CCTACACAGT 


TGTCATGGAT 


2001 


TGGAGAATAA 


TTAATGGAGA 


2051 


ACTTTGGAAG 


CCCCTGATGA 


2101 


ATGTGATAAA 


CCATCATACT 


2151 


GGAAATCACA 


CGAAATTCAC 


2201 


TACGGTTCTG 


GCCATCAATT 




TAACCTTTTC 


ATGGCCTATCj 


2301 


GCTTATCCTT 


TAAACAGCAG 


2351 


CAGTGATTAC 


AAGCTAATGT 


.2401 


AAGATGGTGA 


AATAAAATGG 


2451 


TATATCCATG 


AT CAT TT TAT 


2501 


CCCAATATTT 


ATGGAAGGAG 


2551 


CTCAAGATGA 


TATTGAAAAA 



70 - 

ATCATCGCTA TGCTGAATTA TATGTGATTG 
TGTGAAACTG ATGGGTACTT AACTAAAATG 
TACAATCCAG TCACTTGCGG AAAGCACTTT 
GCAGCCTTTA CTGTTCTGAT ATTCCATCTA 
AAAGAT T GC T ATTTGCAGAG TGATGGTTTT 
AATCTTCCTA TTATCTGGCT ACACAATGTG 
TAGGTTCACT TGACTCTCCA CCAACATGTG 
AAGCCACTGC CTCCATCCAG TGTGAAAGCA 
ATT AT TG AAA ATATCTTGGG AAAAGCCAGT 
AATTCCAGAT TCGCTATGGT TTAAGTGGAA 
TATGAGGTTT ATGATGCAAA ATCAAAATCT 
CTTGTGTGCA GTCTATGCTG TTCAGGTGCG 
TGGGATATTG GAGTAA.TTGG AGCAATCCAG 
ATAAAAGTTC CTATGAGAGG ACCTGAATTT 
TACTATGAAA AAGGAGAAAA ATGTCACTTT 
AAAATGACTC ATTGTGCAGT GTTCAGAGAT 
TCCTGCAATG GAACATGGTC AGAAGATGTG 
TTTCCTGTGG ACAGAGCAAG CACATACTGT 
CAATTGGTGC TTCTGTTGCA AATTTTAATT 
AGCAAAGTAA ATATCGTGCA GTCACTCAGT 
TTGTGTGATT GTTTCCTGGA TACTATCACC 
ATTTTATTAT TGAGTGGAAA AATCTTAATG 
CTTAGAATCT CTTCATCTGT TAAGAAGTAT 
CCCCATTGAG AAGTACCAGT TCAGTCTTTA 
TGGGAAAACC AAAGAT AATT AATAGTTTCA 
CACCAGAGTG ATGCAGGTTT ATATGTAATT 



WO 97/25424 



10 



2601 


GTGCCAGTAA 


TTATTTCCTC 


2651 


ATCACACCAA 


AGAATGAAAA 


2701 


AGAATTGTTC 


CTGGGCACAA 


2751 


ATC T TT T TAT 


CAACi CA T ACA 


2801 


GAGCCTGAAA 


CAATTTCAGA 


2851 


TAAAGATGAG 


ATGATGCCAA 


2901 


ATCTTGAAAA 


GGGTTCTGTT 


2951 


TTCTCTGAGG 


CTGAGGGTAC 


3001 


ACAACCCTTT 


GTTAAATACG 


3051 


AAACTGGTGA 


AGA 



PCT/US97/00128 
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TTCCATCTTA TTGCTTGGAA CATTATTAAT 
AGCTATTTTG GGAAGATGTT CCGAACCCCA 
GGACTTAATT TTCAGAAGAA ACGTTTGAGC 
GCATCAGTGA CATGTGGTCC TCTTCTTTTG 
AGATATCAGT GTTGATACAT CATGGAAAAA 
CAACTGTGGT CTCTCTACTT TCAACAACAG 
TGTTTTAGTG ACCAGTTCAA CAGTGTTAAC 
TGAGGTAACC TATGAGGACG AAAGCCAGAG 
CCACGCTGAT CAGCAACTCT AAACCAAGTG 
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Human OP Rscentor "C" Amino Acid Sequence (Seq . ID No. 5 (Amino 



10 



15 



20 



25 



30 



35 



40 



1 


MICQKFCWL 


LHWEFIYVIT 


AFNLSYPITP 


WRFKLSCMPP 


NSTYDYFLLP 


51 


AGLSKNTSNS 


NGHYETAVEP 


KFNSSGTHFS 


NLSKTTFHCC 


FRSEQDRNCS 


101 


LCADNIEGKT 


FVSTVNSLVF 


QQIDANWNIQ 


CWLKGDLKLF 


ICYVESLFKN 


151 


LFRNYNYKVH 


LLYVLPEVLE 


DSPLVPQKGS 


FQMVHCNCSV 


HECCECLVPV 


201 


PTAKLNDTLL 


MCLKITSGGV 


IFQSPLMSVQ 


PINMVKPDPP 


LGLHME I TDD 


251 


GNLKISWSSP 


PLVPFPLQYQ 


VKYSENSTTV 


IREADKIVSA 


TSLLVDSILP 


301 


GSSYEVQVRG 


KRLDGPGIWS 


DWSTPRVFTT 


QDVIYFPPKI 


LTSVGSNVSF 


351 


HCIYKKENKI 


VPSKEIVWWM 


NLAEKIPQSQ 


YDWSDHVSK 


VTFFNLNETK 


401 


PRGKFTYDAV 


YCCNEHE CHH 


RYAELYVIDV 


NINISCETDG 


YLTKMTCRWS 


451 


TSTIQSLAES 


TLQLRYHRSS 


LYCSDIPSIH 


PISEPKDCYL 


QSDGFYECIF 


501 


QPIFLLSGYT 


MWIRINHSLG 


SLDSPPTCVL 


PDSWKPLPP 


SSVKAEITIN 


551 


IGLLKISWEK 


PVFPENNLQF 


QIRYGLSGKE 


VQWKMYEVYD 


AKSKSVSLPV 


601 


PDLCAVYAVQ 


VRCKRLDGLG 


YWSNWSNPAY 


TWMDIKVPM 


RGPEFWRIIN 


651 


GDTMKKEKNV 


TLLWKPLMKN 


DSLCSVQRYV 


INHHTSCNGT 


WSEDVGNHTK 


701 


FTFLWTEQAH 


TVTVLAINSI 


GASVANFNLT 


FSWPMSKVNI 


VQSLSAYPLN 


751 


'SSCVIVSWIL 


SPSDYKLMYF 


I IEWKNLNED 


GEIKWLRISS 


SVKKYYIHDH 


801 


FIPIEKYQFS 


LYPIFMEGVG 


KPKIINSFTQ 


DDIEKHQSDA 


GLYVIVPVII 


851 


SSSILLLGTL 


LISHQRMKKL 


FWEDVPNPKN 


CSWAQGLNFQ 


KMLEGSMFVK 


901 


SHHHSLISST 


QGHKHCGRPQ 


GPLHRKTRDL 


CSLVYLLTLP 


PLLSYDPAKS 


951 


PSVRNTQE*S 


IKKKKKKLEG 
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Human OB Receptor "C" DNA Sequence (Sea. ID No. fi fDMzn i . 



10 



15 



20 



25 



30 



35 



40 



45 



50 



1 


CCGCCGCCAT 


CTCTGCCTTC 


GGTCGAGTTG 


GACCCCCGGA 


TCAAGGTGTA 


51 


CTTCTCTGAA 


GTAAGATGAT 


TTGTCAAAAA 


TTCTGTGTGG 


TTTTGTTACA 


101 


TTGGGAATTT 


ATTTATGTGA 


TAACTGCGTT 


TAACTTGTCA 


TATCCAATTA 


151 


CTCCTTGGAG 


ATTTAAGTTG 


TCTTGCATGC 


CACCAAATTC 


AACCTATGAC 


201 


TACTTCCTTT 


TGCCTGCTGG 


ACTCTCAAAG 


AATACTTCAA 


ATTCGAATGG 


251 


ACAT TAT GAG 


ACAGCTGTTG 


AACCTAAGTT 


TAATTCAAGT 


GGTACTCACT 


301 


TTTCTAACTT 


ATCCAAAACA 


ACTTTCCACt 


GTTGCTTTCG 


GAGTGAGCAA 


351 


GATAGAAACT 


GCTCCTTATG 


TGCAGACAAC 


ATTGAAGGAA 


AGACATTTGT 


401 


TTCAACAGTA 


AATTCTTTAG 


TTTTTCAACA 


AATAGATGCA 


AACTGGAACA 


451 


TACAGTGCTG 


GCTAAAAGGA 


GACTTAAAAT 


TATTCATCTG 


TTATGTGGAG 


501 


TC AT TAT T T A 


AGAATCTATT 


CAGGAATTAT 


AACTATAAGG 


TCCATCTTTT 


551 


ATATGTTCTG 


CCTGAAGTGT 


TAGAAGATTC 


ACCTCTGGTT 


CCCCAAAAAG 


601 


GCAGTTTTCA 


GATGGTTCAC 


TGCAATTGCA 


GTGTTCATGA 


ATGT TGTGAA 


651 


TGTCTTGTGC 


CTGTGCCAAC 


AGCCAAACTC 


AACGACACTC 


TCCTTATGTG 


701 


TTTGAAAATC 


ACATCTGGTG 


GAGTAATTTT 


CCAGTCACCT 


CTAATGTCAG 


751 


TTCAGCCCAT 


AAATATGGTG 


AAGCCTGATC 


CACCATTAGG 


TTTGCATATG 


801 


GAAAT C AC AG 


ATGATGGTAA 


TTTAAAGATT 


TCTTGGTCCA 


GCCCACCATT 


DO X 




C O AC T T CAAT 


ATCAAGTGAA 


ATAT T C AGAG 


AAT T C TACAA 


901 


CAGTTATCAG 


AGAAGCTGAC 


AAGATTGTCT 


CAGCTACATC 


CCTGCTAGTA 


951 


GACAGTATAC 


TTCCTGGGTC 


TTCGTATGAG 


GTTCAGGTGA 


GGGGCAAGAG 


1001 


ACTGGATGGC 


CCAGGAATCT 


GGAGTGACTG 


GAGTACTCCT 


CGTGTCTTTA 


1051 


CCACACAAGA 


TGTCATATAC 


TTTCCACCTA 


AAATTCTGAC 


AAGTGTTGGG 


1101 


TCTAATGTTT 


CTTTTCACTG 


CATCTATAAG 


AAGGAAAACA 


AGATTGTTCC 


1151 


CTCAAAAGAG 


ATTGTTTGGT 


GGATGAATTT 


AGCTGAGAAA 


ATTCCTCAAA 


1201 


GCCAGTATGA 


TGTTGTGAGT 


GATCATGTTA 


GCAAAGTTAC 


TTTTTTCAAT 
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10 



15 



20 



25 



30 



35 



40 



45 



50 









74 — 






1251 


CTGAATGAAA 


CCAAACCTCG 


AGGAAAGTTT 


ACCTATGATG 


CAGTGTACTG 


1301 


CTGCAATGAA 


CATGAATGCC 


ATCATCGCTA 


TGCTGAATTA 


TATGTGATTG 


1351 


ATGTCAATAT 


CAATATCTCA 


TGTGAAACTG 


ATGGGTACT T 


AACTAAAATG 


1401 


ACTTGCAGAT 


GGTCAACCAG 


TACAATCCAG 


TCACTTGCGG 


AAAGCACTTT 


1451 


GCAATTGAGG 


TATCATAGGA 


GCAGCCTTTA 


CTGTTCTGAT 


ATTCCATCTA 


1501 


TTCATCCCAT 


ATCTGAGCCC 


AAAGATTGCT 


ATTTGCAGAG 


TGATGGTTTT 


1551 


TATGAATGCA 


TTTTCCAGCC 


AATCTTCCTA 


TTATCTGGCT 


ACACAATGTG 


1601 


GATTAGGATC 


AATCACTCTC 


TAGGTTCACT 


TGACTCTCCA 


CCAACATGTG 


1651 


TCCTTCCTGA 


TTCTGTGGTG 


AAGCCACTGC 


CTCCATCCAG 


TGTGAAAGCA 


1701 


GAAATTACTA 


TAAACATTGG 


ATTATTGAAA 


ATATCTTGGG 


AAAAGCCAGT 


1751 


CTTTCCAGAG 


AATAACCTTC 


AATTCCAGAT 


TCGCTATGGT 


TTAAGTGGAA 


1801 


AAGAAGTACA 


ATGGAAGATG 


TATGAGGTTT 


ATGATGCAAA 


ATCAAAATCT 


1851 


GTCAGTCTCC 


CAGTTCCAGA 


CTTGTGTGCA 


GTCTATGCTG 


TTCAGGTGCG 


1901 


CTGTAAGAGG 


CTAGATGGAC 


TGGGATATTG 


GAGTAATTGG 


AGCAATCCAG 


1951 


CCTACACAGT 


TGTCATGGAT 


ATAAAAGTTC 


CTATGAGAGG 


ACCTGAATTT 


O ft ft 1 

2001 


TGGAGAATAA 


T T AAT G GAGA 


T AC T ATG AAA 


AACj CjACjAAAA 


A X\J X (JAO ill 


2051 


ACTTTGGAAG 


CCCCTGATGA 


AAAATGACTC 


ATTGTGCAGT 


(i X 1 LAbAbAT 


2101 


ATGTGATAAA 


CCATCATACT 


TCCTGCAATG 


GAACATGGTC 


AGAAGATGTG 


2151 


GGAAATCACA 


CGAAATTCAC 


TTTCCTGTGG 


ACAGAGCAAG 


CACATACTGT 


2201 


TACGGTTCTG 


GCCATCAATT 


CAATTGGTGC 


TTCTGTTGCA 


AATTTTAATT 


2251 


TAACCTTTTC 


ATGGCCTATG 


AGCAAAG TAA 


ATATCGTGCA 


GTCACTCAGT 


2301 


GCTTATCCTT 


TAAACAGCAG 


TTGTGTGATT 


GTTTCCTGGA 


TACTATCACC 


2351 


CAGTGATTAC 


AAGCTAATGT 


ATTTTATTAT 


TGAGTGGAAA 


AATC TTAATG 


2401 


AAGATGGTGA 


AATAAAATGG 


CTTAGAATCT 


CTTCATCTGT 


TAAGAAGTAT 


2451 


TATATCCATG 


ATCATTTTAT 


CCCCATTGAG 


AAGTACCAGT 


TCAGTCTTTA 


2501 


CCCAATATTT 


ATGGAAGGAG 


TGGGAAAACC 


AAAGATAATT 


AATAGTTTCA 


2551 


CTCAAGATGA 


TATTGAAAAA 


CACCAGAGTG 


ATGCAGGTTT 


ATATGTAATT 
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CCATCTTA TTGCTTGGAA CATTATTAAT 

2601 gtgccagtaa tt^tcctc " GQAAGATGTT ccc^CCCC, 

2651 ««C« A — — ^ ttcaGAAGAT GCTTGAAGGC 

2701 ^-cc^ ^^.^-^ 

2751 ^ tctGCATAGG ^ 

290 1 GACCCTGCCA AATCCCCCTC 
> «1 A***-- — CG AGGGGG 
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51 
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10 1 51 

201 
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IS 

301 



1 MICQKFCW1 LHWEFIYVIT "^^-ULMu^L 

- — ^ z:zr Q :r Hcc f — 

MClKII r ~- 
— PWFPlQrQ j--- KMElrDD 

HCIYKKENKI VP SKEXVWWM -SV GSNVSF 

— 12Z° V ™ 1SCE ™ 

T — ™« SLG SlDSPPT ^ ™ tDCa QSDGFYECIF 
1GLLKI SWEK PVFP ENNLQF P ° SVVKPLPP 

— «-0^ rVTVMlNSI ^ RyV INf » T « NGT KSEDVGNHTK 

— K j — ~ PLN 

'™OFS LyPlFMEGVG 

— — „ 

""ASVTCGP LLLEPetise »„ CSBA <^ 0 KPETFE^FI 

G SVC ISDQFN SVNFSEAEGT ^ — 

"« LKLEG j;;; DQHPNIISPH 

— C lF TDIRVLQDSC S^T^ 
ENNCSK*KKK KKNSRPARPD ~*«* GERKETRVKF 
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20 401 
451 
501 

25 

551 
601 
30 651 
701 
751 

5 

801 
851 
> 901 
951 
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1 


GCGGCCGCCA C 


51 


GGTCGAGTTG < 


101 


TTGTCAAAAA 


151 


TAACTGCGTT 


201 


TCTTGCATGC 


251 


GCTCTCAAAG 


301 


AACCTAAGTT 


351 


ACTTTCCACT 



401 
451 
501 
25 551 



30 



601 
651 
701 
751 
35 801 
851 
901 

40 



45 



50 



TGCAGACAAC ATTGAA^ — rCTAAAAGGA 
TTTTTCAACA AA T AGA T GCA —A == ^ „ 
GACTTAAAAT TATTCATCTG TTATGTGGAG TCATTATTTA A 

AACT ATAAGG TCCATCTTTT ATATGTTCTG CCTGAAGTGT 
CAGGAATTAT AACTATAAGU r-fcTrGTTCAC 
TAGAAGATTC ACCTCTGGTT CCCCAAAAAG GCAGTTTTCA 
TGCAATTGCA GTGTTCACGA ATGTTGTGAA TGTCTTGTGC CTGTGCCAAC 
l" T C AACGACACTC TCCTTATGTG TTTGAAAATC ACATCTGGTG 
ZZ,r =CAG T CACC T CTAATGTCAG -GCCCA, ^ = 
AAGCCTGATC CACCATTAGG TTTGCATATG GAAATCACAG ATGATGGTAA 
AAGCCTGATC CA GCCCRCCRTT GGTACCATTT CCACTTCAAT 

TTTAAAGATT TCTTGGTCCA ^™ AGAAGCTGAC 
ATCAAGTGAA ATATTCAGAG ««« 
« AAGATTGTCT CAGCAC^C 
1001 TTCGTATGAG GTTCAGGTGA GGGGCAA ^ 

105 1 GGAGTGACTG GAGTACTCCT CGTGTCTTTA CC CTTTTCACTG 

ll01 TTT CCACC,A AAATTCTGAC AAGTGTTGGG TCTA*^ ~ 

US1 CATCTATAAG AAGGAAAACA AGATTGTTCC A, 

1201 GGATGAATTT AGCTGAGAAA ATTCCTCAAA GCCAGTATGA T 

1251 GATCATGTTA GCAAAGTTAC TTTTTTCAAT CTGAATGAAA CCAAACCTCG 
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1301 AGGAAAGTTT ACCTATGATG CAGTGTACTG CTGCAATGAA CATGAATGCC 
a 1351 ATCATCGCTA TGCTGAATTA TATGTGATTG ATGTCAATAT CAATATCTCA 
1401 TGTGAAACTG ATGGGTACTT AACTAAAATG ACTTGCAGAT GGTCAACCAG 
1431 TACAAICCAG TCACTTGCGG AAAGCACTTT GCAATTGAGG TATCATAGGA 
10 1501 GCAGCCTTIA CTGTTCTGAT ATTCCATCTA TTCATCCCAT AICTGAGCCC 
1551 AAAGATTGCT ATTTGCAGAG TGATGGTTTT IATGAATGCA TTTTCCAGCC 
>. is 1601 AATCITCCTA TTATCTGGCT ACACAATGTG GATTAGGATC AATCACTCTC 
1651 IAGGITCACT IGACTCTCCA CCAACATGTG TCCTICCIGA TTCTGTGGTG 
1701 AAGCCACTGC CTCCATCCAG TGTGAAAGCA GAAATTACTA TAAACATTGG 
20 1,51 ATTATIGAAA ATATCTTGGG AAAAGCCAGI CTTTCCAGAG AATAACCTTC 
1801 AATTCCAGAT TCGCTATGGT T TAAGTGGAA AAGAAGTACA ATGGAAGATG 
25 1851 TATGAGGTTT ATGATGCAAA ATCAAAATCT GTCAGTCTCC CAGTTCCAGA 
1901 CTTGTGTGCA GTCTATGCIG TTCAGGTGCG CTGTAAGAGG CTAGATGGAC 
155! TGGGATATTG GAGTAATTGG AGCAATCCAG CCTACACAGT TGTCATGGAT 
30 2001 ATAAAAGTTC CTATGAGAGG ACCTGAATTT TGGAGAATAA TTAATGGAGA 
2051 TACTATGAAA AAGGAGAAAA ATGTCACTTT ACITTGGAAG CGCCTGATGA 
3s 2101 AAAATGACTC ATTGTGCAGT GTTCAGAGAT ATGTGATAAA CCATCATACT 
f 2151 TCCTGCAATG GAACATGGTC AGAAGATGTG GGAAATCACA CGAAATTCAC 
2201 TTTCCTGTGG ACAGAGCAAG CACATACTGT TACGGTTCTG GCCATCAATT 
40 2251 CAATTGGTGC TTCTGTTGCA AATTTTAATT TAACCTTTTC ATGGCCTATG 
2301 AGCAAAGTAA ATATCGTGCA GTCACTCAGT GCTTATCCTT TAAACAGCAG 
45 2351 TTGTGTGATT GTTTCCTGGA TACTATCACC CAGTGATTAC AAGCTAATGT 
2401 ATTTTATTAT TGAGTGGAAA AATCTTAATG AAGATGGTGA AATAAAATGG 
2451 CTTAGAATCT CTTCATCTGT TAAGAAGTAT TATATCCATG ATCATTTTAT 
50 2501 CCCCATTGAG AAGTAGCAGT TCAGTCTX T A CCCAATATTT ATGGAAGGAG 
2551 TGGGAAAACC AAAGATAATT AATAGTTTCA CTCAAGATGA TATTGAAAAA 
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2601 CACCAGAGTG ATGCAGGTTT ATATGTAATT GTGCCAGTAA TTATTTCCTC 
265 1 TTCCATCTTA TTGCTTGGAA CATTATTAAT ATCACACCAA AGAATGAAAA 
5 2701 AGCTATTTTG GGAAGATGTT CCGAACCCCA AGAATTGTTC CTGGGCACAA 
275 1 GGACTTAATT TTCAGAAGCC AGAAACGTTT GAGCAICITT TTATCAAGCA 
280 1 TACAGCATCA GTGACATGTG GTCCTCTTCT TTTGGAGCCT GAAACAATTT 
■° 28 51 CAGAAGATAT CAGTGTTGAT ACATCATGGA AAAATAAAGA TGAGAT6ATG 
290 1 CCAACAACTG TGGTCTCTCT ACTTTCAACA ACAGATCTTG AAAAGGGTTC 
15 2951 TGTTTGTATT AGTGACCAGT TCAACAGTGT TAACTTCTCT GAGGCTGAGG 
3001 GTACTGAGGT AACCTATGAG GACGAAAGCC AGAGACAACC CTTTGTTAAA 
30S1 TACGCCACGC TGATCAGCAA CTCTAAACCA AGTGAAACTG GTGAAGAACA 
20 3101 AGGGCTTATA AATAGTTCAG TCACCAAGTG CTTCTCTAGC AAAAATTCTC 
3151 CGTTGAAGGA TTCTTTCTCT AATAGCTCAT GGGAGATAGA GGCCCAGGCA 
25 320 1 TTTTTTATAT TATCGGATCA GCATCCCAAC ATAATTTCAC CACACCTCAC 
325 1 ATTCTCAGAA GGATTGGATG AACTTTTGAA ATTGGAGGGA AATTTCCCTG 
3301 AAGAAAATAA TGATAAAAAG TCTATCTATT ATTTAGGGGT CACCTCAATC 
30 3351 AAAAAGAGAG AGAGTGGTGT GCTTTTGACT GACAAGTCAA GGGTATCGTG 
3401 CCCATTCCCA GCCCCCTGTT TATTCACGGA CATCAGAGTT CTCCAGGACA 
35 3451 GTTGCTCACA CTTTGTAGAA AATAATATCA ACTTAGGAAC TTCTAGTAAG 
3501 AAG ACT TTTG CATCTTACAT GCCTCAATTC CAAACTTGTT CTACTCAGAC 
3551 TCATAAGATC ATGGAAAACA AGATGTGTGA CCTAACTGTG TAATCTAGA 
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Human OP Receptor Protein -r>" rhrnmnsnmai dna T n no gi 



Intron 1 



-taccttttccag 



GTG TAC TTC 



CAT TGG G gtaagttatttg. 
His Trp Glu 
12 13 14 



Intron 2 



. atatcctaacag AA TTT ATT 

Phe lie 
15 16 



CM ATA G gtaagcattagc . 
Gin lie Asp 
122 123 124 



Intron 3 



.ttttaaattcag 



AT GCA AAC 
Ala Asn 
125 126 



TAT GTT CT gtaagtaccaaa . 
Tyr Val Leu 
163 164 165 



Intron 4 



. ttttcaatatag 



CCT GAA 
Pro Glu 
166 167 



AAT ATG G gtaagttatgca . 
Asn Met Val 
233 234 235 



Intron 5 



. tttttccttaag 



TG AAG CCT 
Lys Pro 
236 237 



ATC AGA GAA gtaagtatattt . 
He Arg Glu 
281 282 283 



Intron 6 



. aatatttaacag 



GCT GAC AAG 
Ala Asp Lys 
284 285 286 



ACA CAA G gtaggttatgta . 
Thr Gin Asp 
330 331 332 



Intron 7 



. ccct cattacag 



AT GTC ATA 
Val He 
333 334 



GTG ATT G gtaagaaaacag . 
Val He Asp 
427 428 429 



Intron 8 



. tgtttcaaatag 



AT GTC AAT 
Val Asn 
430 431 



TAT CAT AG gtacgtattatt - 
Tyr His Arg 
466 467 468 



Intron 9 



.tatcttttaaag 



AGC AGC 
Ser Ser 
469 470 



TCT GTG G gtatgtcaagct . 
Ser Val Val 
533 534 535 



Intron 10 



.aaaaatttctag 



TG AAG CCA 
Lys Pro 
536 537 



CAA TGG AAG gtaccttttact . 
Gin Trp Lys 
582 583 584 



Intron 11 



- cttattttacag 



ATG TAT GAG 
Met Tyr Glu 
585 586 587 



ATA AAA G gtctgcagagat 
He Lys Val 
636 637 638 



Intron 12 gtcattttgcag 



TT CCT ATG 
Pro Met 
639 640 
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5 1 
51 

101 

10 151 

201 

251 

301 

351 

2 ° 401 



ob «— IF — ° «^ 

nr: — ^::r-- 
f — -rr Dst — ^r;r— 
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-«« SFTQ ddierh ;; d ss 
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502 
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801 
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5 1 
51 
101 

10 151 



15 



20 



25 



201 


TCTTGCATGC G 


251 


GCTCTCAAAG » 


301 


AACCTAAGTT 1 


351 


ACTTTCCACT 


401 


TGCAGACAAC 


451 


tttttcaaca 


501 





30 



551 
601 
651 
701 
35 "751 
801 
851 
901 
951 



40 



45 



50 



IiG AAGAT TC ACCTCTGGTT XGTCTTGTGC C.G.GCCAAC 

TGCAATTGCA GTGTTCACGA rCatctgCT G 

MC CAAACTC AACGACACTC TCCT^ ^ 

^aa^ ccagxcacc, g ATGRTGGTAR 

AAGCCTGATC ^ QGTACCATTT CCACXTCAA* 

TTTAAAGATT TCTTGG1CCA «=«^ AGAAGCTGAC 

ATCAAGTGAA ATATTCAGAG »«« ^ 

» — - CAGCTACAK AC.GGA.GGC CCACGAA.C* 
1001 TTCGTA.GAG GTTCAGGTGA G-- 

1051 GGAGTGACTG GAG.AC, ^ ^ CT „ T CAC T G 

u01 TW CACC T A AAATTCTGAC ^ ATTGTTTGGT 

- ^ --AAA GCCAGTATGA 

120 1 GGATGAATTT AGCTGAU 
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1401 
10 14 51 

1501 
1551 

15 

1601 



1251 e ** e ***** XXITTTCAiT 

" 0l ACCTATGATG CAG^G CO 

™ — ^GCTGAATTA ^^^^ — 
" ^GTGAAACTG ATGGGTACTT AAC 

TACAATCCAG -AC^CGG — 
GCAGCCTTTA C T G TT C T GA T AT^CAXC™ ^ 
— ATTTGCAGAG 

AATCTTCCTA ITATCTGGCT AC*,, ^ATGAATGCA TTTTCCAGCC 

U« — C T TGACTCTCCA 
20 "0! ct « CCSA ^G TCCTXCCTGA TTCTGTGGTG 

— CCAGA. TCGCTATGGT ^^A 
1"ATGAGGTTT ATGATGCAAA ATC^ 

CTTGTGTGCA C^GC^ G TCAGTCTCC CAGTTCCAGA 

TGGGATAXTG GAGTAATTGG A^ CTS ™= CTAGATGGAC 

*«» TACTATGAAA AAGGAGAAAA A^ T 

— xcacxc ax^gcaT gTZ " ACTTTGGAAG CC «— 

«« TCCTGCAATG GAACAXGGTC A j CCATCATACT 
«1 -CC.G.GG ACAGAG^g ^ 

»« CAATTGGTGC TTCTGTT ^ — "G GCCA.CAA^ 

AGCAAAGTAA A.A^ — 

»* TTGTGTGATT G TTTCCT( ^ ^ ^ T ^^ t - rr TAAACAGCAG 



1801 
1851 
1901 
3 0 1951 

35 

40 

45 

2401 
50 2451 
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, &AAGATAATT Lvrt** CTCAAGATGA TATTGAAAAA 
2 551 TGGGAAAACC AAAGATAAT1 
2601 CACCAGAGTG ATTGATAAGG ATCC 
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51 

10 101 
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201 
251 
301 
351 



15 
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25 



30 



35 



401 
451 
501 
551 
601 
651 
701 
751 
801 



40 851 



45 



901 
951 
1001 
1051 



50 Hoi 
1151 



CCATTGAAGT CAATGGGAGr Trcm^GC^T^' 

TGTACGGTGG GAGGTCTATA TAAGCAGAGC rCG^ "T^" 
TCTCTAGAAG CTGGGTACCA GCTGCIAGCA ^ 

TGTGATGGAT ATCTGCAPa a AGCTTGCTAG CGGCCGCCAG 

ATCTGCAGAA TTCGGCTTTC TCTrrvww 
ACCCCCGGAr CAAGGTGTAC TTCTCTGAAG ™TTGG 
TCTGTGTGGT TTTGTTACAT " 
AACTTGTCAT ATCCAATTAC TCCTTGGAGA 

ACCAAATTCA ACCTATGACT ACTTCCTTTT CTT <**TGCC 
ATACTTCAAA TTCGAATGGA Z^T^ 
AATTCAAGTG GTACTCACTT T^CTAAC^ " 
TTGCTTTCGG AGTGAGC* TCC *^CAA CTTTCCACTG 

ATAGATGCAA ACTGGAa.1 
ATTCATCTGT TATGTGGAGT 

ac TAT aag CT c :;; c ;^ t z m - m ~ 

10 CC CAAAAAGG CAGTTTTrar 
TGTTCACGAA TGTTGTGAa T , A ™="CACT GCAATTGCAG 

-gacacxc, cc;;;c: ;r TGcc TTOa 

— C TAATGTCAGT ^ ""^ 

z^ir: ~ t ctgatcc 

CTTGGTCCAG CCCACC*™,- ^GATCGTAAT TTAAAGATTT 

CCC ACCATTG GTACCATTTC CACTTrisnTi 
TATTCAGAGA ATTCTACAAC zr, ^"^A TCAAGTGAAA 

CC CTGCTAGTAG ACAGTATACT TCCTGGGTCT TCGTATGAGG 
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CTGGATGGCC CAGGAATCTG GAGTGACTGG 

1201 ttcaggtgag gggcaagaga ttccacctaa 

r-^rrmC GTGTCTTTAC CACACAAGAi 
1251 MT " AGTGTTGGGT CO***™ TTTTCACTGC ATCTATAAGA 

1301 ««™ • 

^L* CCAGTATGAT GTTGTGAGTG ATCATGTTAG 
GCTGAGAAAA TTCCTCAAAG „, ArcTC GA GGAAAGTTTA 

.^AGTTACT TTTTTCAATC TGAATGAAAC CAAACCTCGA GGAA* 

T GC AGTGTACTGC TGCAATGAAC ATGAATGCCA TCATCGCTAT 
CCTATGATGC GTG AAACTGA 
GCTGAATTAT A T G~ 

^ "I ^^G ^GAGG, ATCATAGGAG CAGCC^AC 
1651 CACTTGCGGA AAG-C^ ^ ^CA 

110 1 TGTTCTGATA TTCCATCTAT „ TCCAGCC A ATCTTCCTAT 

1751 TTTGCAGAGT GATGGTTTTT ATGAATGCAT TTTCCA 

riatrrGG ATTAGGATCA ATCACTCTCT AGGTTCACTT 
l801 TATCTGGCTA CACAATGTGG AT ^ 

1851 GACTCCCAC CAACATGXG, C^C- 

190 1 TCCATCCAGT GTGAAAGCAG AAATTACTAT ATTCCAGATT 

.^rrrRGTC TTTCCAGAGA ATAACCTTCA *i 
1951 XATCTTGGGA AAAGCCAGTC TGGAAGATGT ATGAGGTTTA 

2001 CGCTATGGTT TAAGTGGAAA AGAAGTACAA ^ 

" I" —C — , GGGATATTGG 
2101 TCTATGCTGT TCAGGTGCGC 

2151 l" — , .AATGGAGA, ACTATGAAAA 

2201 TATGAGAGGA CCT^ 

225 1 AGGAGAAAAA TGTCACTTTA f-ATCATACTT CCTGCAATGG 

2301 TTGTGCAGTG TTCAGAGATA TGTGATAAAC C^A™ 
_ „CA GAAGA^ — 

2401 " ^ 
:~ :r™ c TOT — 



1401 
1451 
1501 
15 1551 



20 



25 1801 



30 



35 2051 



40 
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2551 


TTTCCTGGAT 


ACTATCACCC 


AGTGATTACA AGCTAATGTA 


TTTTATTATT 


2601 


GAGTGGAAAA 


ATCTTAATGA 


AGATGGTGAA ATAAAATGGC 


TTAGAATCTC 


2651 


TTCATCTGTT AAGAAGTATT 


ATATCCATGA TCATTTTATC 


CCCATTGAGA 


2701 


AGTACCAGTT 


CAGTCTTTAC 


CCAATATTTA TGGAAGGAGT 


GGGAAAACCA 


2751 


AAGATAATTA 


ATAGTTTCAC 


TCAAGATGAT ATTGAAAAAC 


ACCAGAGTGA 


2801 


TGCAGGTGAC 


TACAAGGACG 


ACGATGACAA GTAGGGATCC 


AGACATGATA 


2851 


AGATACATTG 


ATGAGTTTGG 


ACAACCCACA ACTAGAATGC 


AGTGAAAAAA 


2901 


ATGCTTTATT 


TGTGAAATTT 


GTGATGCTAT TGCTTTATTT 


GTAACCAT 
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MICQKFCWL. 
AGLSKNTSNS 
LCADNIEGKT 

GNLKISWSSP 



30 



35 



GSSYEVQVRG 
HCIYKKENKI 
PRGKFTYDAV 

rpsTIQSLAES 
QPIFLLSGYT 
IGLLKISWEK 
PDLCAVYAVQ 
GDTMKKEKNV 
F TFLWTEQAH 
SSCVIVSWIL 

FTIL 



LHWEFIYVIT 
UGHYETAVEP 
FVSTVNSLVF 
LLYVLPEVLE 
MCLKITSGGV 

PLVPFPLQYQ 
KRLDGPGIWS 
VPSKEIVWWM 
YCCNEHECHH 
TLQLR^HRSS 
MWIRINHSLG 

PVFPENNLQF 
VRCKRLDGLG 
TLLWKPLMKN 

TVTVLAIN SI 
SPSDYKLMYF 



AFULSYPITP 
KFNSSGTHFS 
qqIDANWNIQ 
DSPLVPQKGS 
IFQSP1-MSVQ 
VKYSENSTTV 
DWSTPRVFTT 

NLAEKIPQSQ 
RYAELYVIDV 

LYCSDIPSIH 
SLDSPPTCVL 
QIRYGLSGKE 
YWSNVISNPAY 
DSLCSVQRYV 
GASVANF^LT 
IIEWKN^NED 



tfLSKTTFHCC 
CWLKGDLKLF 
FQMVHCNCSV 

PINMVKPDPP 
IREADKIVSA 

QDVIYFPPKI 

ydvvsdhvsk 

NINISCETDG 
PISEPKDCYL 
PDSWKPLPP 
VQWKMYEVYD 
YYVMDIKVPM 
INHHTSCNGT 
FSWPMSKVNI 
GEIKWLRISS 



FRSEQDRNCS 
ICYVESLFKN 
HECCECLVPV 
LGLHMEITDD 
TSLLVDSILP 
LTSVGSNVSF 
VTFFNLNETK 
YLTKMTCRWS 
QSDGFYECIF 
SSVKAEITIN 
AKSKSVSLPV 

RGPEFWRUN 
WSEDVGNHTK 

VQSLSAYPLN 
SVKKYYIHGK 
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GCGGCCGCCA GTGTGATGGA TATCTGC^a 

GGTCGAGTTG GACCCCCGGA Tr ^ CTCTGCC TTC 

CCCCGGA TCAAGGTGTA CTTCTCTra a „ 
—CAAAAA XTCXGTGTGG TXXXGXXACA XXGGGmT 
TAACTGCGXX TAACTTGTCA TATCCAATTA CXC^ 

CACCAAATTC AACCTATGAC XACXX^X ^TAAGXXG 
GCTCTCAAAG AAXACXXCAA ATTCGAATGG 1, 
— TAATTCAAGT GGXACX^ 

ACTTTCCACT GTTGCTTTCr r*n TTTCTAACTT ATCCAAAACA 

A_r ^ ^ GCTCCTTATG 
TXXXTCAACA AAXAGATGCA AACTGGA^ AATTCTTTAG 
— AAAA, TAXXCAXCXG X^T .« 
CAGGAAIXAX AACTATAAGG rc^Z ****** 
"1 TAGAAGATTC ACCXCXGGXX CCCC^T 

- -CAAACXC AACGACACXC X^ 
GAGXAAXXXX CCAGXCACCX CXAAXGXCAG 
--CCXGAXC CACCAXXAGG XXXGCAXAXG GA^ 

TXTAAAGAXX XCXXGGXCCA GCCCAC, ' AT0M <*»» 

AXCAAGXGAA AXAXXCAGAG 

— ™~T CAGCXACAXC CCXGcx^ ^ 
TTCGXAXGAG GXXCAGGXGA ~ ^™XAC TXCCXGGGXC 

--cxGGAGXAc^err:;:^^™ 

TTTCCACCTA AAATTCTGAr a C ^CACAAGA TGTCATATAC 

~~~ «— ^rr :™ 



751 

35 

801 
851 
10 901 
951 
1001 

5 

1051 



1101 
1151 
1201 
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1251 


GATCATGTTA 


GCAAAGTTAC 


TTTTTTCAAT 


CTGAATGAAA 


CCAAACCTCG 


1301 


AGGAAAGTTT 


ACCTATGATG 


CAGTGTACTG 


CTGCAATGAA 


CATGAATGCC 


1351 


ATCATCGCTA 


TGCTGAATTA 


TATGTGATTG 


ATGTCAATAT 


CAATATCTCA 


1401 


TGTGAAACTG 


ATGGGTACTT 


AACTAAAATG 


ACTTGCAGAT 


GGTCAACCAG 


1451 


TACAATCCAG 


TCACTTGCGG 


AAAGCACTTT 


GCAATTGAGG 


TATCATAGGA 


1501 


GCAGCCTTTA 


CTGTTCTGAT 


ATTCCATCTA 


TTCATCCCAT 


ATCTGAGCCC 


1551 


AAAGATTGCT 


ATTTGCAGAG 


TGATGGTTTT 


TATGAATGCA 


TTTTCCAGCC 


1601 


AATCTTCCTA 


TTATCTGGCT 


ACACAATGTG 


GATTAGGATC 


AATCACTCTC 


1651 


TAGGTTCACT 


TGACTCTCCA 


CCAACATGTG 


TCCTTCCTGA 


TTCTGTGGTG 


1701 


AAGCCACTGC 


CTCCATCCAG 


TGTGAAAGCA 


GAAATTACTA 


TAAACATTGG 


1751 


ATTATTGAAA 


ATATCTTGGG 


AAAAGCCAGT .CTTTCCAGAG 


AATAACCTTC 


1801 


AATTCCAGAT 


TCGCTATGGT 


TTAAGTGGAA 


AAGAAGTACA 


m mf /"^ ^v tv, tv m /"» 

ATGG AAG AT G 


1851 


TATGAGGT.TT 


ATGATGCAAA 


ATCAAAATCT 


GTCAGTCTCC 


CAG T TCC AGA 


1901 


CTTGTGTGCA 


GTCTATGCTG 


TTCAGGTGCG 


CTGTAAGAGG 


CTAGATGGAC 


1951 


TGGGATATTG 


GAGTAATTGG 


AGCAATC CAG 


CCTACACAGT 


TGTCATGGAT 


2001 


ATAAAAGTTC 


CTATGAGAGG 


ACCTGAATTT 


TGGAGAATAA 


T T AA T Cj Ci AL? A 


2051 


TACT AT G AAA 


AAGGAGAAAA 


ATGTCACTTT 


ACTTTGGAAG 


LLtL lCiAlCjA 


2101 


AAAATGACTC 


ATTGTGCAGT 


GTTCAGAGAT 


ATGTGATAAA 


CCATCATAL I 


2151 


TCCTGCAATG 


GAACATGGTC 


AGAAGATGTG 


GGAAATCACA 


CGAAATTCAC 


2201 


TTTCCTGTGG 


ACAGAGCAAG 


C AC AT AC T G T 


TACGGTTCTG 


GCCATCAATT 


2251 


CAATTGGTGC 


TTCTGTTGCA 


AATTTTAATT 


TAACCTTTTC 


nlooLL inlo 


2301 


AGCAAAGTAA 


ATATCGTGCA 


GTCACTCAGT 


GCTTATCCTT 


TAAACAGCAG 


2351 


TTGTGTGATT 


GTTTCCTGGA 


TACTATCACC 


CAGTGATTAC 


AAGCTAATGT 


2401 


ATTTTATTAT 


TGAGTGGAAA 


AATCTTAATG 


AAGATGGTGA 


AATAAAATGG 


2451 


CTTAGAATCT 


CTTCATCTGT 


T AAGAAG TAT 


TATATCCATG 


GTAAGTTTAC 


2501 


TATACTT 
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While the present invention has been described 
in terms of preferred embodiments/ it is understood that 
variations and modifications will occur to those skilled 
in the art. Therefore, it is intended that the appended 
5 claims cover all such equivalent variations which come 
within the scope of the invention as claimed. 
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SEQUENCE LISTING 



5 (1) GENERAL INFORMATION: 

(i) APPLICANT: CHANG, MING-SHI 

WELCHER, ANDREW A. 
FLETCHER, FREDERICK A. 

10 

(ii) TITLE OF INVENTION: OB PROTEIN RECEPTOR AND RELATED 
COMPOSITIONS AND METHODS 

(iii) NUMBER OF SEQUENCES: 33 

15 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE : Amgen Inc. 

<B) STREET: 1840 Dehavilland Drive 
(C) CITY: Thousand Oaks 
20 (D) STATE: California 

(E) COUNTRY: USA 

(F) ZIP: 91320 

(v) COMPUTER READABLE FORM: 
25 (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1.30 

30 (vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C ) CLAS S IF ICAT ION : 

35 (viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Pessin, Karol M. 
<C) REFERENCE /DOCKET NUMBER: A-382-A 



40 (2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 965 amino acids 

(B) TYPE: amino acid 

45 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



50 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Met lie Cys Gin Lys Phe Cys Val Val Leu Leu His Trp Glu Phe lie 
5 15 10 15 

Tyr Val lie Thr Ala Phe Asn Leu Ser Tyr Pro lie Thr Pro Trp Arg 
20 25 30 

10 Phe Lys Leu Ser Cys Met Pro Pro Asn Ser Thr Tyr Asp Tyr Phe Leu 

35 40 45 

Leu Pro Ala Gly Leu Ser Lys Asn Thr Ser Asn Ser Asn Gly His Tyr 
50 55 60 

Glu Thr Ala Val Glu Pro Lys Phe Asn Ser Ser Gly Thr His Phe Ser 
65 70 75 80 



15 



Asn Leu Ser Lys Thr Thr Phe His Cys Cys Phe Arg Ser Glu Gin Asp 
20 85 90 95 

Arg Asn Cys Ser Leu Cys Ala Asp Asn lie Glu Gly Lys Thr Phe Val 
100 105 110 

25 Ser Thr Val Asn Ser Leu Val Phe Gin Gin lie Asp Ala Asn Trp Asn 

115 120 125 

lie Gin Cys Trp Leu Lys Gly Asp Leu Lys Leu Phe lie Cys Tyr Val 
130 135 140 



30 



45 



Glu Ser Leu Phe Lys Asn Leu Phe Arg Asn Tyr Asn Tyr Lys Val His 
145 150 155 160 



Leu Leu Tyr Val Leu Pro Glu Val Leu Glu Asp Ser Pro Leu Val Pro 

35 165 170 175 

Gin Lys Gly Ser Phe Gin Met Val His Cys Asn Cys Ser Val His Glu 

180 185 190 

40 Cys Cys Glu Cys Leu Val Pro Val Pro Thr Ala Lys Leu Asn Asp Thr 

195 200 205 



Leu Leu Met Cys Leu Lys lie Thr Ser Gly Gly Val lie Phe Gin Ser 
210 215 220 

Pro Leu Met Ser Val Gin Pro lie Asn Met Val Lys Pro Asp Pro Pro 

225 230 235 240 



Leu Gly Leu His Met Glu lie Thr Asp Asp Gly Asn Leu Lys lie Ser 
50 245 250 255 

Trp Ser Ser Pro Pro Leu Val Pro Phe Pro Leu Gin Tyr Gin Val Lys 
260 265 270 
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Tyr Ser Glu Asn Ser Thr Thr Val He Arg Glu Ala A S p Lys H e Val 

280 285 

Ser Ala Thr Ser Leu Leu Val A 3 p Ser He Leu Pro Gly Ser Ser Tyr 
290 295 • 300 

Glu Val Gin Val Arg Gly Lys Arg Leu Asp Gly Pro Gly He Trp Ser 

310 315 320 

Asp Trp Ser Thr Pro Arg Val Phe Thr Thr Gin Asp Val He 

335 

Pro Pro Lys lie Leu Thr Ser Val Gly Ser Asn Val Ser Phe His Cys 
340 345 3 50 

He Tyr Lys Lys Glu Asn Lys He val Pro Ser Lys Glu He Val Trp 
355 360 365 * 

Trp Met Asn Leu Ala Glu Lys H e p ro Gin Ser Gin Tyr Asp Val Val 
J '° 375 3 8 o 

Ser Asp His Val Ser Lys Val Thr Phe Phe Asn Leu Asn Glu Thr Lys 

390 395 400 



;;r "- y vax rne ™r em Asp Val He Tyr Phe 

325 33 0 



Pro Arg Gly Lys Phe Thr Tyr Asp Ala 



405 410 



Val Tyr Cys Cys Asn Glu His 

415 



Glu Cys His His Arg Tyr Ala Glu Leu Tyr Val He Asp Val Asn lie 
420 425 430 

Asn He Ser Cys Glu Thr Asp Gly Tyr Leu Thr Lys Met Thr Cys Arg 



440 445 
Thr Ser Thr He Gin Ser Leu Ala ci„ s., 

450 



Trp Ser Thr Ser Thr He Gin Ser Leu Ala Glu Ser Thr Leu Gin Leu 

455 460 



Arg Tyr His Arg Ser Ser Leu Tyr Cys Ser Asp. He Pro Ser He His 

470 475 480 

Pro lie ser Glu Pio Lys Asp Cys Tyr Leu Gin Ser Asp Gly Phe Tyr 
485 490 495 

Glu Cys He Phe Gin Pro He Phe Leu Leu Ser Gly Tyr Th 
500 505 510 

He Arg He Asn His Ser Leu Gly Ser Leu Asp Ser Pro Pro Thr Cys 



r Met Trp 



520 525 

Lys Pro Leu Pro Pro 
535 540 



Val Leu Pro Asp Ser Val Val Lys Pro Leu Pro Pro Ser Ser Val Lys 
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Ala Glu He Thr He Asn He Gly Leu Leu Lys He Ser Trp Glu Lys 
545 550 555 

Pro Val Phe Pro Glu Asn Asn Leu Gin Phe Gin He Arg Tyr Gly Leu 
565 570 55 

Ser Gly Lys Glu Val Gin Trp Lys Met Tyr Glu Val Tyr Asp Ala Lys 
580 585 

Ser Lys Ser Val Ser Leu Pro Val Pro Asp Leu Cys Ala Val Tyr Ala 

595 « 00 605 

Val Gin Val Arg Cys Lys Arg Leu Asp Gly Leu Gly Tyr Trp Ser Asn 

610 «15 620 

Trp Ser Asn Pro Ala Tyr Thr Val Val Met Asp He Lys Val Pro Met 
625 630 635 ' 

Arg Gly Pro Glu Phe Trp Arg He He Asn Gly Asp Thr Met Lys Lys 
€45 650 

Glu Lys Asn Val Thr Leu Leu Trp Lys Pro Leu Met Lys Asn Asp Ser 
660 665 

L eu Cys Ser Val Gin Arg Tyr Val He Asn His His Thr Ser Cys Asn 
675 680 685 



Gly Thr Trp Ser Glu Asp Val Gly Asn His Thr Lys Phe Thr Phe Leu 
690 695 ™° 

Trp Thr Glu Gin Ala His Thr Val Thr Val Leu Ala He Asn Ser lie 
705 "»10 715 

Gly Ala Ser Val Ala Asn Phe Asn Leu Thr Phe Ser Trp Pro Met Ser 
725 730 735 

Lys Val Asn He Val Gin Ser Leu Ser Ala Tyr Pro Leu Asn Ser Ser 
740 7 <5 750 

Cys Val He Val Ser Trp He Leu Ser Pro Ser Asp Tyr Lys Leu Met 
755 760 ? 65 

Tyr Phe He He Glu Trp Lys Asn Leu Asn Glu Asp Gly Glu He Lys 

770 77 5 780 

Trp Leu Arg He Ser Ser Ser Val Lys Lys Tyr Tyr He His Asp His 
785 7 9° 

Phe He Pro He Glu Lys Tyr Gin Phe Ser Leu Tyr Pro He Phe Met 
805 810 

Glu Gly Val Gly Lys Pro Lys He He Asn Ser Phe Thr Gin Asp Asp 
820 825 830 
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lie Glu Lys His 
835 



Gin Ser Asp Ala Gly Leu Tyr Val lie Val Pro Val 
840 845 



lie lie Ser Ser 
850 



Ser lie Leu Leu Leu Gly Thr Leu Leu He Ser His 
855 860 



Gin Arg Met Lys 
865 



Lys Leu Phe Trp Glu Asp Val Pro Asn Pro Lys Asn 
870 875 880 



Cys Ser Trp Ala 



Gin Gly Leu Asn Phe Gin Lys Arg. Thr Asp He Leu 
885 890 895 



Ser Leu He Met He Thr Thr Asp Glu Pro Asn Val Pro Thr Ser Gin 
900 905 910 

Gin Ser He Glu Tyr Lys He Phe Thr Phe Arg Arg Gly Ala Asn Leu 
. 915 920 925 

Lys Lys He Gin Leu Asn Phe Glu Leu Thr Tyr Gly Gly Leu Cys Phe 
930 935 940 

Arg Thr Asn Arg Cys Val Asn Leu Gly Ser Lys Cys Arg Phe Glu Ser 
945 950 955 960 

Ser Leu Asp Val Leu 
965 

(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3193 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS ; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

CCGCCGCCAT CTCTGCCTTC GGTCGAGTTG GACCCCCGGA TCAAGGTGTA CTTCTCTGAA 60 

GTAAGATGAT TTGTCAAAAA TTCTGTGTGG TTTTGTTACA TTGGGAATTT ATTTATGTGA 120 

TAACTGCGTT TAACTTGTCA TATCCAATTA CTCCTTGGAG ATTTAAGTTG TCTTGCATGC 180 

CACCAAATTC AACCTATGAC TACTTCCTTT TGCCTGCTGG ACTCTCAAAG AATACTTCAA 240 

ATTCGAATGG ACATTATGAG ACAGCTGTTG AACCTAAGTT TAATTCAAGT GGTACTCACT 300 
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TTTCTAACTT ATCCAAAACA ACTTTCCACT GTTGCTTTCG GAGTGAGCAA GATAGAAACT 360 

GCTCCTTATG TGCAGACAAC ATTGAAGGAA AGACATTTGT TTCAACAGTA AATTCTTTAG 420 

TTTTTCAACA AATAGATGCA, AACTGGAACA TACAGTGCTG GCTAAAAGGA GACTTAAAAT 480 

TATTCATCTG TTATGTGGAG TCATTATTTA AGAATCTATT CAGGAATTAT AACTATAAGG 540 . 

TCCATCTTTT ATATGTTCTG CCTGAAGTGT TAGAAGATTC ACCTCTGGTT CCCCAAAAAG 600 

GCAGTTTTCA GATGGTTCAC TGCAATTGCA GTGTTCATGA ATGTTGTGAA TGTCTTGTGC 660 

CTGTGCCAAC AGCCAAACTC AACGACACTC TCCTTATGTG TTTGAAAATC ACATCTGGTG 720 

GAGTAATTTT CCAGTCACCT CTAATGTCAG T TCAGCCC AT AAATATGGTG AAGCCTGATC 7 80 
CACCATTAGG TTTGCATATG GAAATCACAG ATGATGGTAA TTTAAAGATT TCTTGGTCCA 840 
GCCCACCATT GGTACCATTT CCACTTCAAT ATCAAGTGAA ATATTCAGAG AATTCTACAA 900 
CAGTTATCAG AGAAGCTGAC AAGATTGTCT CAGCTACATC CCTGCTAGTA G AC AG TAT AC 960 

TTCCTGGGTC TTCGTATGAG GTTCAGGTGA GGGGCAAGAG ACTGGATGGC CCAGGAATCT 1020 

GGAGTGACTG GAGTACTCCT CGTGTCTTTA CCACACAAGA TGTCATATAC TTTCCACCTA 1080 

AAATTCTGAC AAGTGTTGGG TCTAATGTTT CTTTTCACTG CATCTATAAG AAGGAAAACA 1140 

AGATTGTTCC CTCAAAAGAG ATTGTTTGGT GGATGAATTT AGCTGAGAAA ATTCCTCAAA 1200 

GCCAGTATGA TGTTGTGAGT GATCATGTTA GCAAAGTTAC TTTTTTCAAT CTGAATGAAA 1260 

CCAAACCTCG AGGAAAGTTT ACCTATGATG CAGTGTACTG CTGCAATGAA CATGAATGCC 1320 

ATCATCGCTA TGCTGAATTA TATGTGATTG ATGTCAATAT CAATATCTCA TGTGAAACTG 1380 

ATGGGTACTT AACTAAAATG AC T TGC AG AT GGTCAACCAG TACAATCCAG TCACTTGCGG 1440 

AAAGCACTTT GCAATTGAGG TATCATAGGA GCAGCCTTTA CTGTTCTGAT ATTCCATCTA 1500 

TTCATCCCAT ATCTGAGCCC AAAGATTGCT ATTTGCAGAG TGATGGTTTT TATGAATGCA 1560 

TTTTCCAGCC AATCTTCCTA TTATCTGGCT ACACAATGTG GATTAGGATC AATCACTCTC 1620 

TAGGTTCACT TGACTCTCCA CCAACATGTG TCCTTCCTGA TTCTGTGGTG AAGCCACTGC 1680 
CTCCATCCAG TGTGAAAGCA GAAATTACTA TAAACATTGG ATTATTGAAA ATATCTTGGG 17 40 

AAAAGCCAGT CTTTCCAGAG AATAACCTTC AATTCCAGAT TCGCTATGGT TTAAGTGGAA ^ 1800 
AAGAAGTACA ATGGAAGATG TATGAGGTTT ATGATGCAAA ATCAAAATCT GTCAGTCTCC 1860 
CAGTTCCAGA CTTGTGTGCA GTCTATGCTG TTCAGGTGCG CTGTAAGAGG CTAGATGGAC 1920 
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TGGGATATTG GAGTAATTGG AGCAATCCAG CCTACACAGT TGTCATGGAT ATAAAAGTTC 1980 

CTATGAGAGG ACCTGAATTT TGGAGAATAA TTAATGGAGA TACTATGAAA AAGGAGAAAA 2040 

5 

ATGTCACTTT ACTTTGGAAG* CCCCTGATGA AAAATGACTC ATTGTGCAGT GTTCAGAGAT 2100 

ATGTGATAAA CCATCATACT TCCTGCAATG GAACATGGTC AGAAGATGTG GGAAATCACA 2160 

10 CGAAATTCAC TTTCCTGTGG ACAGAGCAAG CACATACTGT TACGGTTCTG GCCATCAATT 2220 



CAATTGGTGC TTCTGTTGCA AATTTTAATT TAACCTTTTC ATGGCCTATG AGCAAAGTAA 2280 



ATATCGTGCA GTCACTCAGT GCTTATCCTT TAAACAGCAG TTGTGTGATT GTTTCCTGGA 2340 

15 

TACTATCACC CAGTGATTAC AAGCTAATGT ATTTTATTAT TGAGTGGAAA AATCTTAATG 2400 



AAGATGGTGA AATAAAATGG CTTAGAATCT CTTCATCTGT TAAGAAGTAT TATATCCATG 2460 
20 ATCATTTTAT CCCCATTGAG AAGTACCAGT TCAGTCTTTA CCCAATATTT ATGGAAGGAG 2520 



TGGGAAAACC AAAGATAATT AATAGTTTCA CTCAAGATGA TATTGAAAAA CACCAGAGTG 2580 



ATGCAGGTTT ATATGTAATT GTGCCAGTAA TTATTTCCTC TTCCATCTTA TTGCTTGGAA 2 640 

25 

CAT T ATT AAT ATCACACCAA AGAATGAAAA AGCTATTTTG GGAAGATGTT CCGAACCCCA 2700 



AGAATTGTTC CTGGGCACAA GGACTTAATT TTCAGAAGAG AACGGACATT CTTTGAAGTC 2760 
30 TAATCATGAT CACTACAGAT GAACCCAATG TGCCAACTTC CCAACAGTCT ATAGAGTATT 2820 



AGAAGATTTT TAC AT TTTGA AGAAGGGGAG CAAATCTAAA AAAAAT TC AG TTGAACTTCT . 2880 

GAG AG T T AAC ATATGGTGGA TTATGTTGAT TTAGAACTTA AAATAGATGT GTAAATTTGG 2940 

35 

GTTCAAAATG TAGATTTGAG TCCAGTTTGG ATGTGTGATT AATTTTCAAA TCATCTAAAG 3000 



TTTAAAAGTA GTATTCATGA TTTCTGGCTT TTGATTTGCC ATATTCCTGG TCATAAAACA 3060 
40 TTAAGAAAAT TATGGCTGTT GCTGTCATTA CATATCTATT AAATGTCATC AAATATGTAG 3120 
TAGACAATTT TGTAATTAGG TGAACTCTAA AACTGCAACA TCTGACAAAT TGCTTTAAAA 3180 



ATACAATGAT TAT 3193 

45 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i> SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 995 amino acids 
50 (B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

Met lie Cys Gin Lys Phe Cys Val Val Leu Leu His Trp Glu Phe lie 
1 5 10 15 

Tyr Val lie Thr Ala Phe Asn Leu Ser Tyr Pro lie Thr Pro Trp Arg 
20 25 30 



Phe Lys Leu Ser Cys Met Pro Pro Asn Ser Thr Tyr Asp Tyr Phe Leu 
15 35 40 45 

Leu Pro Ala Gly Leu Ser Lys Asn Thr Ser Asn Ser Asn Gly His Tyr 
50 55 60 

20 Glu Thr Ala Val Glu Pro Lys Phe Asn Ser Ser Gly Thr His Phe Ser 

65 70 75 80 

v Asn Leu Ser Lys Thr Thr Phe His Cys Cys Phe Arg Ser Glu Gin Asp 
85 90 95 

Arg Asn Cys Ser Leu Cys Ala Asp Asn lie Glu Gly Lys Thr Phe Val 
100 105 HO 



Ser Thr Val Asn Ser Leu Val Phe Gin Gin lie Asp Ala Asn Trp Asn 
30 115 120 125 

lie Gin Cys Trp Leu Lys Gly Asp Leu Lys Leu Phe lie Cys Tyr Val 
130 135 140 

35 Glu Ser Leu Phe Lys Asn Leu Phe Arg Asn Tyr Asn Tyr Lys Val His 

145 150 155 160 



Leu Leu Tyr Val Leu Pro Glu Val Leu Glu Asp Ser Pro Leu Val Pro 
165 170 175 

Gin Lys Gly Ser Phe Gin Met Val His Cys Asn Cys Ser Val His Glu 
180 185 190 



Cys Cys Glu Cys Leu Val Pro Val Pro Thr Ala Lys Leu Asn Asp Thr 
45 195 200 205 

Leu Leu Met Cys Leu Lys lie Thr Ser Gly Gly Val lie Phe Gin Ser 
210 215 220 



50 



Pro Leu Met Ser Val Gin Pro lie Asn Met Val' Lys Pro Asp Pro Pro 
225 230 235 240 
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Leu Gly Leu His Met Glu He Thr Asp Asp Gly Asn Leu Lys He Ser 
245 250 255 

Trp Ser Ser Pro Pro Leu Val Pro Phe Pro Leu Gin Tyr Gin Val Lys 

5 260 265 270 

Tyr Ser Glu Asn Ser Thr Thr Val lie Arg Glu Ala Asp Lys lie Val 

275 280 285 

10 Ser Ala Thr Ser Leu Leu Val Asp Ser He Leu Pro Gly Ser Ser Tyr 
290 295 300 



15 



30 



45 



Glu Val Gin Val Arg Gly Lys Arg Leu Asp Gly Pro Gly He Trp Ser 
305 310 315 320 

Asp Trp Ser Thr Pro Arg Val Phe Thr Thr. Gin Asp Val lie Tyr Phe 
325 330 335 



Pro Pro Lys He Leu Thr Ser Val Gly Ser Asn Val Ser Phe His Cys 
20 340 345 350 

He Tyr Lys Lys Glu Asn Lys He Val Pro Ser Lys Glu He Val Trp 
355 360 365 

25 Trp Met Asn Leu Ala Glu Lys He Pro Gin Ser Gin Tyr Asp Val Val 

370 375 380 

Ser Asp His Val Ser Lys Val Thr Phe Phe Asn Leu Asn Glu Thr Lys 
385 390 395 400 



Pro Arg Gly Lys Phe Thr Tyr Asp Ala Val Tyr Cys Cys Asn Glu His 
405 410 415 



Glu Cys His His Arg Tyr Ala Glu Leu Tyr Val He Asp Val Asn He 
35 420 425 430 

Asn He Ser Cys Glu Thr Asp Gly Tyr Leu Thr Lys Met Thr Cys Arg 
435 - 440 445 

40 Trp Ser Thr Ser Thr He Gin Ser Leu Ala Glu Ser Thr Leu Gin Leu 

450 455 460 

Arg Tyr His Arg Ser Ser Leu Tyr Cys Ser Asp He Pro Ser He His 
465 470 475 480 



Pro He Ser Glu Pro Lys Asp Cys Tyr Leu Gin Ser Asp Gly Phe Tyr 
485 490 495 



Glu Cys He Phe Gin Pro He Phe Leu Leu Ser Gly Tyr Thr Met Trp 
50 500 505 510 



He Arg He Asn His Ser Leu Gly Ser Leu Asp Ser Pro Pro Thr Cys 
515 520 525 
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Val Leu Pro Asp Ser Val Val Lys Pro Leu Pro Pro Ser Ser Val Lys 
530 535 540 

Ala Glu lie Thr lie Asn lie Gly Leu Leu Lys lie Ser Trp Glu Lys 
545 .550 555 560 

Pro Val Phe Pro Glu Asn Asn Leu Gin Phe Gin lie Arg Tyr Gly Leu 
565 570 575 

Ser Gly Lys Glu Val Gin Trp Lys Met Tyr Glu Val Tyr Asp Ala Lys 
580 585 590 

Ser Lys Ser Val Ser Leu Pro Val Pro Asp Leu Cys Ala Val Tyr Ala 
595 600 605 

Val Gin Val Arg Cys Lys Arg Leu Asp Gly Leu Gly Tyr. Trp Ser Asn 
610 615 620 

Trp Ser Asn Pro Ala Tyr Thr Val Val Met Asp lie Lys Val Pro Met 
625 630 635 640 

Arg Gly Pro Glu Phe Trp Arg lie lie Asn Gly Asp Thr Met Lys. Lys 
645 650 655 

Glu Lys Asn Val Thr Leu Leu Trp Lys Pro Leu Met Lys Asn Asp Ser 
660 665 670 

Leu Cys Ser Val Gin Arg Tyr Val lie Asn His His Thr Ser Cys Asn 
675 680 685 

Gly Thr Trp Ser Glu Asp Val Gly Asn His Thr Lys Phe Thr Phe Leu 
690 695 700 

Trp Thr Glu Gin Ala His Thr Val Thr Val Leu Ala He Asn Ser He 
705 710 715 720 

Gly Ala Ser Val Ala Asn Phe Asn Leu Thr- Phe Ser Trp Pro Met Ser 
725 730 735 

Lys Val Asn lie Val Gin Ser Leu Ser Ala Tyr Pro Leu Asn Ser Ser 
740 745 750 

Cys Val lie Val Ser Trp lie Leu Ser Pro Ser Asp Tyr Lys Leu Met 
755 760 765 

Tyr Phe He He Glu Trp Lys Asn Leu Asn Glu Asp Gly . Glu He Lys 
770 775 780 

Trp Leu Arg He Ser Ser Ser Val Lys Lys Tyr Tyr lie His Asp His 
785 790 795 800 
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Phe lie Pro lie Glu Lys Tyr Gin Phe Ser Leu Tyr Pro lie Phe Met 
805 810 815 

Glu Gly Val Gly Lys Pro Lys lie lie Asn Ser Phe Thr Gin Asp Asp 
5 820 825 830 

lie Glu Lys His Gin Ser Asp Ala Gly Leu Tyr Val He Val Pro Val 
835 840 845 

10 lie He Ser Ser Ser He Leu Leu Leu Gly Thr Leu Leu He Ser His 

850 855 860 



15 



30 



Gin Arg Met Lys Lys Leu Phe Trp Glu Asp Val Pro Asn Pro Lys Asn 

865 870 875 880 

Cys Ser Trp Ala Gin Gly Leu Asn Phe Gin Lys Lys Arg Leu Ser He 

885 890 895 



Phe Leu Ser Ser He Gin His Gin His Val Val Leu Phe Phe Trp Ser 
20 900 905 910 

Leu Lys Gin Phe Gin Lys He Ser Val Leu He His His Gly Lys He 

915 920 925 

25 Lys Met Arg Cys Gin Gin Leu Trp Ser Leu Tyr Phe Gin Gin Gin He 

930 935 940 



Leu Lys Arg Val Leu Phe Val Leu Val Thr Ser Ser Thr Val Leu Thr 
945 950 955 960 

Ser Leu Arg Leu Arg Val Leu Arg Pro Met Arg Thr Lys Ala Arg Asp 
965 970 975 



Asn Pro Leu Leu Ash Thr Pro Arg Ser Ala Thr Leu Asn Gin Val Lys 
35 980 985 990 

Leu Val Lys 
995 

40 (2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3063 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



50 
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30 
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40 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
CCGCCGCCAT CTCTGCCTTC GGTCGAGTTG GACCCCCGGA TCAAGGTGTA CTTCTCTGAA 
GTAAGATGAT TTGTCAAAAA r TTCTGTGTGG TTTTGTTACA TTGGGAATTT ATTTATGTGA 
TAACTGCGTT TAACTTGTCA TATCCAATTA CTCCTTGGAG ATTTAAGTTG TCTTGCATGC 
CACCAAATTC AACCTATGAC TACTTCCTTT TGCCTGCTGG ACTCTCAAAG AATACTTCAA 
ATTCGAATGG ACATTATGAG ACAGCTGTTG AACCTAAGTT TAATTCAAGT GGTACTCACT 
TTTCTAACTT ATCCAAAACA ACTTTCCACT GTTGCTTTCG GAGTGAGCAA GATAGAAACT 
GCTCCTTATG TGCAGACAAC ATTGAAGGAA AGACATTTGT TTCAACAGTA AATTCTTTAG 
TTTTTCAACA AATAGATGCA AACTGGAACA TACAGTGCTG GCTAAAAGGA GACTTAAAAT 
TATTCATCTG TTATGTGGAG TCATTATTTA AGAATCTATT CAGGAATTAT AACTATAAGG 
TCCATCTTTT ATATGTTCTG CCTGAAGTGT TAGAAGATTC ACCTCTGGTT CCCCAAAAAG 
GCAGTTTTCA GATGGTTCAC TGCAATTGCA GTGTTCATGA ATGTTGTGAA TGTCTTGTGC 
CTGTGCCAAC AGCCAAACTC AACGACACTC TCCTTATGTG TTTGAAAATC ACATCTGGTG 
GAGTAATTTT CCAGTCACCT CTAATGTCAG TTCAGCCCAT AAATATGGTG AAGCCTGATC 
CACCATTAGG TTTGCATATG GAAATCACAG ATGATGGTAA TTTAAAGATT TCTTGGTCCA 
GCCCACCATT GGTACCATTT CCACTTCAAT ATCAAGTGAA ATATTCAGAG AATTCTACAA 
CAGTTATCAG AGAAGCTGAC AAGATTGTCT CAGCTACATC CCTGCTAGTA GACAGTATAC 
TTCCTGGGTC TTCGTATGAG GTTCAGGTGA GGGGCAAGAG ACTGGATGGC CCAGGAATCT 
GGAGTGACTG GAGTACTCCT CGTGTCTTTA CCACACAAGA TGTCATATAC TTTCCACCTA 
AAATTCTGAC AAGTGTTGGG TCTAATGTTT CTTTTCACTG CATCTATAAG AAGGAAAACA 
AGATTGTTCC CTCAAAAGAG ATTGTTTGGT GGATGAATTT AGCTGAGAAA ATTCCTCAAA 
GCCAGTATGA TGTTGTGAGT GATCATGTTA GCAAAGTTAC TTTTTTCAAT CTGAATGAAA 
CCAAACCTCG AGGAAAGTTT ACCTATGATG CAGTGTACTG CTGCAATGAA CATGAATGCC 
ATCATCGCTA TGCTGAATTA TATGTGATTG ATGTCAATAT CAATATCTCA TGTGAAACTG 
ATGGGTACTT AACTAAAATG ACTTGCAGAT GGTCAACCAG TACAATCCAG TCACTTGCGG 
AAAGCACTTT GCAATTGAGG TATCATAGGA GCAGCCTTTA CTGTTCTGAT ATTCCATCTA 
TTCATCCCAT ATCTGAGCCC AAAGATTGCT ATTTGCAGAG TGATGGTTTT TATGAATGCA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
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1140 
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20 



TTTTCCAGCC AATCTTCCTA TTATCTGGCT ACACAATGTG GATTAGGATC AATCACTCTC 
s TAGGTTCACT TGACTCTCCA CCAACATGTG TCCTTCCTGA TTCTGTGGTG AAGCCACTGC 
CTCCATCCAG TGTGAAAGCA GAAATTACTA TAAACATTGG ATTATTGAAA ATATCTTGGG 
AAAAGCCAGT CTTTCCAGAG AATAACCTTC AATTCCAGAT TCGCTATGGT TTAAGTGGAA 
10 AAGAAGTACA ATGGAAGATG TATGAGGTTT ATGATGCAAA ATCAAAATCT GTCAGTCTCC 
CAGTTCCAGA CTTGTGTGCA GTCTATGCTG TTCAGGTGCG CTGTAAGAGG CTAGATGGAC 
^ TGGGATATTG GAGTAATTGG AGCAATCCAG CCTACACAGT TGTCATGGAT ATAAAAGTTC 
CTATGAGAGG ACCTGAATTT TGGAGAATAA TTAATGGAGA TACTATGAAA AAGGAGAAAA 
ATGTCACTTT ACTTTGGAAG CCCCTGATGA AAAATGACTC ATTGTGCAGT GTTCAGAGAT 
ATGTGATAAA CCATCATACT TCCTGCAATG GAACATGGTC AGAAGATGTG GGAAATCACA 
CGAAATTCAC TTTCCTGTGG ACAGAGCAAG CACATACTGT TACGGTTCTG GCCATCAATT 
^ CAATTGGTGC TTCTGTTGCA AATTTTAATT TAACCTTTTC ATGGCCTATG AGCAAAGTAA 
ATATCGTGCA GTCACTCAGT GCTTATCCTT TAAACAGCAG TTGTGTGATT GTTTCCTGGA 
TACTATCACC CAGTGATTAC AAGCTAATGT ATTTTATTAT TGAGTGGAAA AATCTTAATG 
30 AAGATGGTGA AATAAAATGG CTTAGAATCT CTTCATCTGT TAAGAAGTAT TATATCCATG 
ATCATTTTAT CCCCATTGAG AAGTACCAGT TCAGTCTTTA CCCAATATTT ATGGAAGGAG 
^ TGGGAAAACC AAAGATAATT AATAGTTTCA CTCAAGATGA TATTGAAAAA CACCAGAGTG 
ATGCAGGTTT ATATGTAATT GTGCCAGTAA TTATTTCCTC TTCCATCTTA TTGCTTGGAA 
CATTATTAAT ATCACACCAA AGAATGAAAA AGCTATTTTG GGAAGATGTT CCGAACCCCA 
40 AGAATTGTTC CTGGGCACAA GGACTTAATT TTCAGAAGAA ACGTTTGAGC ATCTTTTTAT 
CAAGCATACA GCATCAGTGA CATGTGGTCC TCTTCTTTTG GAGCCTGAAA CAATTTCAGA 
^ AGATATCAGT GTTGATACAT CATGGAAAAA TAAAGATGAG ATGATGCCAA CAACTGTGGT 
CTCTCTACTT TCAACAACAG ATCTTGAAAA GGGTTCTGTT TGTTTTAGTG ACCAGTTCAA 
CAGTGTTAAC TTCTCTGAGG CTGAGGGTAC TGAGGTAACC TATGAGGACG AAAGCCAGAG 
ACAACCCTTT GTTAAATACG CCACGCTGAT CAGCAACTCT AAACCAAGTG AAACTGGTGA 



AGA 
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(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 969 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Met He Cys Gin Lys Phe Cys Val Val Leu Leu His Trp Glu Phe He 
1 .5 10 . 15 

Tvr Val He Thr Ala Phe Asn Leu Ser Tyr Pro He Thr Pro Trp Arg 
20 25 30 

Phe Lys Leu Ser Cys Met Pro Pro Asn Ser Thr Tyr Asp Tyr Phe Leu 
35 40 45 

Leu Pro Ala Gly Leu Ser Lys Asn Thr Ser Asn Ser Asn Gly His Tyr 
50 55 60 

Glu Thr Ala Val Glu Pro Lys Phe Asn Ser Ser Gly Thr His Phe Ser 
65 70 75 80 

Asn Leu Ser Lys Thr Thr Phe His Cys Cys Phe Arg Ser Glu Gin Asp 
85 90 95 

Arg Asn Cys Ser Leu Cys Ala Asp Asn lie Glu Gly Lys Thr Phe Val 
100 105 HO 

Ser Thr Val Asn Ser Leu Val Phe Gin Gin He Asp Ala Asn Trp Asn 
115 120 125 

He Gin Cys Trp Leu Lys Gly Asp Leu Lys Leu Phe He Cys Tyr Val 
130 135 140 

Glu Ser Leu Phe Lys Asn Leu Phe Arg Asn Tyr Asn Tyr Lys Val His 
145 150 155 160 

Leu Leu Tyr Val Leu Pro Glu Val Leu Glu Asp Ser Pro Leu Val Pro 
165 170 175 

Gin Lys Gly Ser Phe Gin Met Val His Cys Asn Cys Ser Val His Glu 
180 185 190 

Cys Cys Glu Cys Leu Val Pro Val Pro Thr Ala Lys Leu Asn Asp Thr 
195 200 205 
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Leu Leu Met Cys Leu Lys lie Thr Ser Gly Gly Val lie Phe Gin Ser 
210 215 220 

Pro Leu Met Ser Val Gin Pro He Asn Met Val Lys Pro Asp Pro Pro 
225 230 235 240 

Leu Gly Leu His Met Glu He Thr Asp Asp Gly Asn Leu Lys He Ser 
245 250 255 

Trp Ser Ser Pro Pro Leu Val Pro Phe Pro Leu Gin Tyr Gin Val Lvs 
260 265 270 

Tyr Ser Glu Asn Ser Thr Thr Val He Arg Glu Ala Asp Lys He Val 
275 280 285 

SSr LeU LSU Val As P Ser Ile Leu p "> Gly Ser Ser Tyr 
290 295 300 

Glu Val Gin Val Arg Gly Lys Arg Leu Asp Gly Pro Gly Ile Trp Ser 
305 310 315 320 

Asp Trp Ser Thr Pro Arg Val Phe Thr Thr Gin Asp Val Ile Tyr Phe 
325 330 335 

Pro Pro Lys Ile Leu Thr Ser Val Gly Ser Asn Val Ser Phe His Cys 
340 345 3 50 

He Tyr Lys Lys Glu Asn Lys Ile Val Pro Ser Lys Glu He Val Trp 
355 360 365 

Trp Met Asn Leu Ala Glu Lys He Pro Gin Ser Gin Tyr Asp Val Val 
370 375 3 8 o 

Ser Asp His Val Ser Lys Val Thr Phe Phe Asn Leu Asn Glu Thr Lys 
385 too ~„- ■* 

J5,u 395 400 

Pro Arg Gly Lys Phe Thr Tyr Asp Ala Val Tyr Cys Cys Asn Glu His 
405 410 415 

Glu Cys His His Arg Tyr Ala Glu Leu Tyr Val He Asp Val Asn Ile 
420 425 43 0 

Asn He Ser Cys Glu Thr Asp Gly Tyr Leu Thr Lys Met Thr Cys Arg 
435 440 445 

Trp Ser Thr Ser Thr He Gin Ser Leu Ala Glu Ser Thr Leu Gin Leu 
450 455 460 

Arg Tyr His Arg Ser Ser Leu Tyr Cys Ser Asp Ile Pro Ser Ile His 
465 4 ™ 475 480 
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Pro He Ser Glu Pro Lys Asp Cys Tyr Leu Gin Ser Asp Gly Phe Tyr 
485 490 9 

Glu Cys He Phe Gin Pro He Phe Leu Leu Ser Gly Tyr Thr Met Trp 
500 505 

He Arg He Asn His Ser Leu Gly Ser Leu Asp Ser Pro Pro Thr Cys 
515 520 525 

1 Leu Pro Asp Ser Val Val Lys Pro Leu Pro Pro Ser Ser Val Lys 



Va 

530 



535 540 



Ala Glu lie Thr He Asn He Gly Leu Leu Lys He Ser Trp Glu Lys 
545 550 555 

Pro Val Phe Pro Glu Asn Asn Leu Gin Phe Gin He Arg Tyr Gly Leu 
565 570 575 

Ser Gly Lys Glu Val Gin Trp Lys Met Tyr Glu Val Tyr Asp Ala Lys 

580 585 

Ser Lys Ser Val Ser Leu Pro Val Pro Asp Leu Cys Ala Val Tyr Ala 
595 600 605 

Val Gin Val Arg Cys Lys Arg Leu Asp Gly Leu Gly Tyr Trp Ser Asn 
610 615 620 

Trp Ser Asn Pro Ala Tyr Thr Val Val Met Asp He Lys Val Pro Met 
625 630 635 

Arg Gly Pro Glu Phe Trp Arg He He Asn Gly Asp Thr Met Lys Lys 
645 650 655 

Glu Lys Asn Val Thr Leu Leu Trp Lys Pro Leu Met Lys'Mn Asp Ser 



660 



665 



His His Thr Ser Cys Asn 



Leu Cys Ser Val Gin Arg Tyr Val He Asn 

675 680 685 

Gly Thr Trp Ser Glu Asp Val Gly Asn His Thr Lys Phe Thr Phe Leu 
•* 700 



690 



695 



Trp Thr Glu Gin Ala His Thr Val Thr Val Leu Ala He Asn Ser He 
705 710 715 

Gly Ala Ser Val Ala Asn Phe Asn Leu Thr Phe Ser Trp Pro Met Ser 



725 



730 



Lys Val Asn He Val Gin Ser Leu Ser Ala Tyr Pro Leu Asn Ser Ser 
740 745 "0 

Cys Val lie Val Ser Trp lie Leu Ser Pro Ser Asp Tyr Lys Leu Met 
755 760 7 65 
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10 



25 



40 



Tyr Phe lie lie Glu Trp Lys Asn Leu Asn Glu Asp Gly Glu lie Lys 
770 775 780 

Trp Leu Arg lie Ser Ser Ser Val Lys Lys Tyr Tyr lie His Asp His 
785 790 795 800 

Phe lie Pro lie Glu Lys Tyr Gin Phe Ser Leu Tyr Pro lie Phe Met 
805 810 815 

Glu Gly Val Gly Lys Pro Lys lie lie Asn Ser Phe Thr Gin Asp Asp 
820 825 830 



lie Glu Lys His Gin Ser Asp Ala Gly Leu Tyr Val lie Val Pro Val 
15 835 840 845 

lie lie Ser Ser Ser lie Leu Leu Leu Gly Thr Leu Leu lie Ser His 
850 855 860 

20 Gin Arg Met Lys Lys Leu Phe Trp Glu Asp Val Pro Asn Pro Lys Asn 

865 870 875 880 

Cys Ser Trp Ala Gin Gly Leu Asn Phe Gin Lys Met Leu Glu Gly Ser 
885 890 895 



Met Phe Val Lys Ser His His His Ser Leu lie Ser Ser Thr Gin Gly 
900 905 910 



His Lys His Cys Gly Arg Pro Gin Gly Pro Leu His Arg Lys Thr Arg 
30 915 920 925 

Asp Leu Cys Ser Leu Val Tyr Leu Leu Thr Leu Pro Pro Leu Leu Ser 
930 935 940 

35 Tyr Asp Pro Ala Lys Ser Pro Ser Val Arg Asn Thr Gin Glu Ser He 

945 950 955 960 



Lys Lys Lys Lys Lys Lys Leu Glu Gly 
965 

(2) INFORMATION FOR SEQ ID NO: 6: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 969 amino acids 
45 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

50 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Met lie Cys Gin Lys Phe Cys Val Val Leu Leu His Trp Glu Phe lie 
1 5 10 15 

Tyr Val He Thr Ala Phe Asn Leu Ser Tyr Pro He Thr Pro Trp Arg 
20 25 30 

Phe Lys Leu Ser Cys Met Pro Pro Asn Ser Thr Tyr Asp Tyr Phe Leu 
35 40 45 

Leu Pro Ala Gly Leu Ser Lys Asn Thr Ser Asn Ser Asn Gly His Tyr 
50 55 60 

Glu Thr Ala Val Glu Pro Lys Phe Asn Ser Ser Gly Thr His Phe Ser 
65 70 75 80 

Asn Leu Ser Lys Thr Thr Phe His Cys Cys Phe Arg Ser Glu Gin Asp 
85 90 95 

Arg Asn Cys Ser Leu Cys Ala Asp Asn lie Glu Gly Lys Thr Phe Val 
100 105 HO 

Ser Thr Val Asn Ser Leu Val Phe Gin Gin He Asp Ala Asn Trp Asn 
115 120 125 

He Gin Cys Trp Leu Lys Gly Asp Leu Lys Leu Phe He Cys Tyr Val 
130 135 140 

Glu Ser Leu Phe Lys Asn Leu Phe Arg Asn Tyr Asn Tyr Lys Val His 
145 150 155 160 

Leu Leu Tyr Val Leu Pro* Glu Val Leu Glu Asp Ser Pro Leu Val Pro 
165 170 175 

Gin Lys Gly Ser Phe Gin Met Val His Cys Asn Cys Ser Val His Glu 
180 185 190 

Cys Cys Glu Cys Leu Val Pro Val Pro Thr Ala Lys Leu Asn Asp Thr 
195 200 205 

Leu Leu Met Cys Leu Lys He Thr Ser Gly Gly Val He Phe Gin Ser 
210 215 220 

Pro Leu Met Ser Val Gin Pro He Asn Met Val Lys Pro Asp Pro Pro 
225 230 235 240 

Leu Gly Leu His Met Glu He Thr Asp Asp Gly Asn Leu Lys He Ser 
245 250 255 

Trp Ser Ser Pro Pro Leu Val Pro Phe Pro Leu Gin Tyr Gin Val Lys 
260 265 270 
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Tyr Ser Glu Asn Ser Thr Thr Val lie Arg Glu Ala Asp Lys lie Val 
275 280 285 

Ser Ala Thr Ser Leu Leu Val Asp Ser lie Leu Pro Gly Ser Ser Tyr 
5 290 295 300 

Glu Val Gin Val Arg Gly Lys Arg Leu Asp Gly Pro Gly lie Trp Ser 
305 310 315 320 

10 Asp Trp Ser Thr Pro Arg Val Phe Thr Thr Gin Asp Val lie Tyr Phe 

325 330 335 



15 



30 



45 



Pro Pro Lys lie Leu Thr Ser Val Gly Ser Asn Val Ser Phe His Cys 

340 345 350 

lie Tyr Lys Lys Glu Asn Lys lie Val Pro Ser Lys Glu lie Val Trp 

355 360 365 



Trp Met Asn Leu Ala Glu Lys lie Pro Gin Ser Gin Tyr Asp Val Val 
20 370 375 380 

Ser Asp His Val Ser Lys Val Thr Phe Phe Asn Leu Asn Glu Thr Lys 
385 390 395 400 

25 Pro Arg Gly Lys Phe Thr Tyr Asp Ala Val Tyr Cys Cys Asn Glu His 

405 410 415 

Glu Cys His His Arg Tyr Ala Glu Leu Tyr Val lie Asp Val Asn lie 
420 425 430 



Asn lie Ser Cys Glu Thr Asp Gly Tyr Leu Thr Lys Met Thr Cys Arg 
435 440 445 



Trp Ser Thr Ser Thr lie Gin Ser Leu Ala Glu Ser Thr Leu Gin Leu 
35 450 455 460 

Arg Tyr His Arg Ser Ser Leu Tyr Cys Ser Asp lie Pro Ser lie His 
465 470 475 480 

40 Pro lie Ser Glu Pro Lys Asp Cys Tyr Leu Gin Ser Asp Gly Phe Tyr 

485 490 495 



Glu Cys lie Phe Gin Pro lie Phe Leu Leu Ser Gly Tyr Thr Met Trp 
500 505 510 

lie Arg lie Asn His Ser Leu Gly Ser Leu Asp Ser Pro Pro Thr Cys 
515 520 525 



Val Leu Pro Asp Ser Val Val Lys Pro Leu Pro Pro Ser Ser Val Lys 

50 530 535 540 

Ala Glu lie Thr lie Asn lie Gly Leu Leu Lys lie Ser Trp Glu Lys 

545 550 555 560 
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Pro Val Phe Pro Glu 
565 

Ser Gly Lys Glu Val 
580 

Ser Lys Ser Val Ser 
595 

Val Gin Val Arg Cys 
610 

Trp Ser Asn Pro Ala 
625 

Arg Gly Pro Glu Phe 
645 

Glu Lys Asn Val Thr 
660 

Leu Cys Ser Val Gin 
675 

Gly Thr Trp Ser Glu 
690 

Trp Thr Glu Gin Ala 
705 

Gly Ala Ser Val Ala 
725 

Lys Val Asn lie Val 
740 

Cys Val lie Val Ser 
755 

Tyr Phe lie lie Glu 
770 

Trp Leu Arg lie Ser 
785 

Phe lie Pro lie Glu 
805 

Glu Gly Val Gly Lys 
820 
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Asn Asn Leu Gin Phe Gin 
570 



Gin Trp Lys Met Tyr Glu 
585 

Leu Pro Val Pro Asp Leu 
600 

Lys Arg Leu Asp Gly Leu 
615 

Tyr Thr Val Val Met Asp 
630 635 

Trp Arg lie He Asn Gly 
650 

Leu Leu Trp Lys Pro Leu 
665 

Arg Tyr Val He Asn His 
680 

Asp Val Gly Asn His Thr 
695 

His Thr Val Thr Val Leu 
710 715 

Asn Phe Asn Leu Thr Phe 
730 

Gin Ser Leu Ser Ala Tyr 
745 

Trp He Leu Ser Pro Ser 
760 

Trp Lys Asn Leu Asn Glu 
775 

Ser Ser Val Lys Lys Tyr 
790 795 

Lys Tyr Gin Phe Ser Leu 
810 

Pro Lys He He Asn Ser 
825 



He Arg Tyr Gly Leu 
575 

Val Tyr Asp Ala Lys 
590 

Cys Ala Val Tyr Ala 
605 

Gly Tyr Trp Ser Asn 
620 

He Lys Val Pro Met 
640 



Asp Thr Met Lys Lys 
655 

Met Lys Asn Asp Ser 
670 

His Thr Ser Cys Asn 
685 

Lys Phe Thr Phe Leu 
700 

Ala He Asn Ser He 
720 

Ser Trp Pro Met Ser 
735 

Pro Leu Asn Ser Ser 
750 

Asp Tyr Lys Leu Met 
765 

Asp Gly Glu lie Lys 
780 

Tyr He His Asp His 
800 

Tyr Pro He Phe Met 
815 

Phe Thr Gin Asp Asp 
830 
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lie Glu Lys His Gin Ser Asp Ala Gly Leu Tyr Val lie Val Pro Val 
835 840 845 

lie lie Ser Ser Ser lie Leu Leu Leu Gly Thr Leu Leu lie Ser His 
5 850 855 860 

Gin Arg Met Lys Lys Leu Phe Trp Glu Asp Val Pro Asn Pro Lys Asn 
865 870 875 880 

10 Cys Ser Trp Ala Gin Gly Leu Asn Phe Gin Lys Met Leu Glu Gly Ser 

885 890 895 



15 



35 



40 



45 



Met Phe Val Lys Ser His His His Ser Leu lie Ser Ser Thr Gin Gly 

900 905 910 

His Lys His Cys Gly Arg Pro Gin Gly Pro Leu His Arg Lys Thr Arg 
915 920 925 



Asp Leu Cys Ser Leu Val Tyr Leu Leu Thr Leu Pro Pro Leu Leu Ser 
20 930 935 940 

Tyr Asp Pro Ala Lys Ser Pro Ser Val Arg Asn Thr Gin Glu Ser lie 
945 950 955 960 

25 Lys Lys Lys Lys Lys Lys Leu Glu Gly 

965 

(2) INFORMATION FOR SEQ ID NO : 7 : 

30 (i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH : 1216 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

Met lie Cys Gin Lys Phe Cys Val Val Leu Leu His Trp Glu Phe lie 
15 10 15 

Tyr Val lie Thr Ala Phe Asn Leu Ser Tyr Pro lie Thr Pro Trp Arg 
20 ... 25 30 



Phe Lys Leu Ser Cys Met Pro Pro Asn Ser Thr Tyr Asp Tyr Phe Leu 

50 35 40 45 

/ 

Leu Pro Ala Gly Leu Ser Lys Asn Thr Ser Asn Ser Asn Gly His Tyr 
50 55 60 
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25 



40 
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Glu Thr Ala Val Glu Pro Lys Phe Asn Ser Ser Gly Thr His Phe Ser 
65 70 75 80 

Asn Leu Ser Lys . Thr Thr Phe His Cys Cys Phe Arg Ser Glu Gin Asp 
s ' 85 90 95 

Arg Asn Cys Ser Leu Cys Ala Asp Asn He Glu Gly Lys Thr Phe Val 
100 105 HO 

Ser Thr Val Asn Ser Leu Val Phe Gin Gin He Asp Ala Asn Trp Asn 
115 120 125 



He Gin Cys Trp Leu Lys Gly Asp Leu Lys Leu Phe He Cys Tyr Val 
15 130 135 140 

Glu Ser Leu Phe Lys Asn Leu Phe Arg Asn Tyr Asn Tyr Lys Val His 
145 150 155 160 

20 Leu Leu Tyr Val Leu Pro Glu Val Leu Glu Asp Ser Pro Leu Val Pro 

165 170 175 



Gin Lys Gly Ser Phe Gin Met Val His Cys Asn Cys Ser Val His Glu 
180 185 190 

Cys Cys Glu Cys Leu Val Pro Val Pro Thr Ala Lys Leu Asn Asp Thr 
195 200 205 

Leu Leu Met Cys Leu Lys He Thr Ser Gly Gly Val He Phe Gin Ser 
30 210 215 220 

Pro Leu Met Ser Val Gin Pro He Asn Met Val Lys Pro Asp Pro Pro 
225 230 235 240 

35 Leu Gly Leu His Met Glu lie Thr Asp Asp Gly Asn Leu Lys He Ser 

245 250 255 

Trp Ser Ser Pro Pro Leu Val Pro Phe Pro Leu Gin Tyr Gin Val Lys 
260 265 270 

Tyr Ser Glu Asn Ser Thr Thr Val He Arg Glu Ala Asp Lys He Val 
275 280 285 



Ser Ala Thr Ser Leu Leu Val Asp Ser He Leu Pro Gly Ser Ser Tyr 
45 290 295 300 

Glu Val Gin Val Arg Gly Lys Arg Leu Asp Gly Pro Gly He Trp Ser 
305 310 315 320 

50 Asp Trp Ser Thr Pro Arg Val Phe Thr Thr Gin Asp Val lie Tyr Phe 

325 330 335 
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Pro Pro Lys He Leu Thr Ser Val Gly Ser Asn Val Ser Phe His Cys 
340 345 350 

lie Tyr Lys Lys Glu Asn Lys He Val Pro Ser Lys Glu lie Val Tro 
355 p 360 365 

Trp Met Asn Leu Ala Glu Lys He Pro Gin Ser Gin Tyr Asp Val Val 
37 ° 375 380 

Ser Asp His Val Ser Lys Val Thr Phe Phe Asn Leu Asn Glu Thr Lvs 
385 3 *0 395 400 

Pro Arg Gly Lys Phe Thr Tyr Asp Ala Val Tyr Cys Cys Asn Glu His 
405 410 415 

Glu Cys His His Arg Tyr Ala Glu Leu Tyr Val He Asp Val Asn He 
420 425 430 

Asn He Ser Cys Glu Thr Asp Gly Tyr Leu Thr Lys Met Thr Cys Arg 
435 440 445 

Trp Ser Thr Ser Thr lie Gin Ser Leu Ala Glu Ser Thr Leu Gin Leu 
450 455 460 

Arg Tyr His Arg Ser Ser Leu Tyr Cys Ser Asp He Pro Ser He His 
465 470 475 480 

Pro He Ser Glu Pro Lys Asp Cys Tyr Leu Gin Ser Asp Gly Phe Tyr 
485 490 495 

Glu Cys He Phe Gin Pro He Phe Leu Leu Ser Gly Tyr Thr Met Trp 
500 505 510 

He Arg He Asn His Ser Leu Gly Ser Leu Asp Ser Pro Pro Thr Cys 
515 520 525 

Val Leu Pro Asp Ser Val Val Lys Pro Leu Pro Pro Ser Ser Val Lys 
530 535 540 

Ala Glu He Thr He Asn He Gly Leu Leu Lys He Ser Trp Glu Lys 
545 550 555 560 

Pro Val Phe Pro Glu Asn Asn Leu Gin Phe Gin He Arg Tyr Gly Leu 
565 570 575 

Ser Gly Lys Glu Val Gin Trp Lys Met Tyr Glu Val Tyr Asp Ala Lys 
580 585 590 

Ser Lys Ser Val Ser Leu Pro Val Pro Asp Leu Cys Ala Val Tyr Ala 
595 600 605 

Val Gin Val Arg Cys Lys Arg Leu Asp Gly Leu Gly Tyr Trp Ser Asn 
610 615 620 
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Trp Ser Asn Pro Ala Tyr Thr Val Val Met Asp lie Lys Val Pro Met 
625 630 635 640 

Arg Gly Pro Glu. Phe Trp Arg He He Asn Gly Asp Thr Met Lys Lys 
645 650 655 

Glu Lys Asn Val Thr Leu Leu Trp Lys Pro Leu Met Lys Asn Asp Ser 
660 665 670 

Leu Cys Ser Val Gin Arg Tyr Val He Asn His His Thr Ser Cys Asn 
675 680 685 

Gly Thr Trp Ser Glu Asp Val Gly Asn His Thr Lys Phe Thr Phe Leu 
690 695 700 

Trp Thr Glu Gin Ala His Thr Val Thr Val Leu Ala He Asn Ser He 
705 710 715 720 

Gly Ala Ser Val Ala Asn Phe Asn Leu Thr Phe Ser Trp Pro Met Ser 
725 730 735 

Lys Val Asn He Val Gin Ser Leu Ser Ala Tyr Pro Leu Asn Ser Ser 
740 745 750 

Cys Val He Val Ser Trp He Leu Ser Pro Ser Asp Tyr Lys Leu Met 
755 760 765 

Tyr Phe He He Glu Trp Lys Asn Leu Asn Glu Asp Gly Glu He Lys 
770 775 780 

Trp Leu Arg He Ser Ser Ser Val Lys Lys Tyr Tyr He His Asp His 
785 790 795 800 

Phe He Pro He Glu Lys Tyr Gin Phe Ser Leu Tyr Pro He Phe Met 
805 810 815 

Glu Gly Val Gly Lys Pro Lys lie He Asn Ser Phe Thr Gin Asp Asp 
820 825 830 

He Glu Lys His Gin Ser Asp Ala Gly Leu Tyr Val He Val Pro Val 
835 840 845 

He He Ser Ser Ser He Leu Leu Leu Gly Thr Leu Leu He Ser His 
850 855 860 

Gin Arg Met Lys Lys Leu Phe Trp Glu Asp Val Pro Asn Pro Lys Asn 
865 870 875 880 

Cys Ser Trp Ala Gin Gly Leu Asn Phe Gin Lys Pro Glu Thr Phe Glu 
885 890 895 
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His Leu Phe lie Lys His Thr Ala Ser Val Thr Cys Gly Pro Leu Leu 
900 905 910 

Leu Glu Pro Glu Thr lie Ser Glu Asp lie Ser Val Asp Thr Ser Trp 
915 920 925 

Lys Asn Lys Asp Glu Met Met Pro Thr Thr Val Val Ser Leu Leu Ser 
930 935 940 



Thr Thr Asp Leu Glu Lys Gly Ser Val Cys lie Ser Asp Gin Phe Asn 
945 950 955 960 

Ser Val Asn Phe Ser Glu Ala Glu Gly Thr Glu Val Thr Tyr Glu Asp 
965 970 975 

Glu Ser Gin Arg Gin Pro Phe Val Lys Tyr Ala Thr Leu lie Ser Asn 
980 985 990 

Ser Lys Pro Ser Glu Thr Gly Glu Glu Gin Gly Leu lie Asn Ser Ser 
995 1000 1005 

Val Thr Lys Cys Phe Ser Ser Lys Asn Ser Pro Leu Lys Asp Ser Phe 
1010 1015 1020 

Ser Asn Ser Ser Trp Glu lie Glu Ala Gin Ala Phe Phe lie Leu Ser 
1025 1030 1035 1040 

Asp Gin His Pro Asn lie lie Ser Pro His Leu Thr Phe Ser Glu Gly 
1045 1050 1055 



Leu Asp Glu Leu Leu Lys Leu Glu 
1060 

Asp Lys Lys Ser lie Tyr Tyr Leu 
1075 108< 

Glu Ser Gly Val Leu Leu Thr Asp 
1090 1095 

Pro Ala Pro Cys Leu Phe Thr Asp 
1105 1110 

Ser His Phe Val Glu Asn Asn lie 
1125 

Thr Phe Ala Ser Tyr Met Pro Gin 
1140 

His Lys lie Met Glu Asn Lys Met 
1155 116 



Gly Asn Phe Pro Glu Glu Asn Asn 
1065 1070 

Gly Val Thr Ser lie Lys Lys Arg 
I 1085 

Lys Ser Arg Val Ser Cys Pro Phe 
1100 

lie Arg Val Leu Gin Asp Ser Cys 
1115 1120 

Asn Leu Gly Thr Ser Ser Lys Lys 
1130 1135 

Phe Gin Thr Cys Ser Thr Gin Thr 
1145 1150 

Cys Asp Leu Thr Val Phe His Arg 
) 1165 



Asn Leu Gin lie Cys Val He Met Gly Asn He Lys Cys Asn Arg Leu 
1170 1175 1180 
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25 



- 118 - 



Leu Trp Val Gly Glu Arg Lys Glu Thr Arg Val Lys Phe Glu Asn Asn 
1185 1190 1195 1200 

Cys Ser Lys Lys, Lys Lys Lys Lys Asn Ser Arg Pro Ala Arg Pro Asp 
1205 1210 1215 



<2) INFORMATION FOR SEQ ID NO: 8: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3599 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

GCGGCCGCCA GTGTGATGGA TATCTGCAGA ATTCGGCTTT CTCTGCCTTC GGTCGAGTTG 60 

GACCCCCGGA TCAAGGTGTA CTTCTCTGAA GTAAGATGAT TTGTCAAAAA TTCTGTGTGG 120 

TTTTGTTACA TTGGGAATTT ATTTATGTGA TAACTGCGTT TAACTTGTCA TATCCAATTA 180 

30 CTCCTTGGAG ATTTAAGTTG TCTTGCATGC CACCAAATTC AACCTATGAC TACTTCCTTT 240 

TGCCTGCTGG GCTCTCAAAG AATACTTCAA ATTCGAATGG ACAT TATGAG ACAGCTGTTG 300 

AACCTAAGTT TAATTCAAGT GGTACTCACT TTTCTAACTT ATCCAAAACA ACTTTCCACT 360 

35 

GTTGCTTTCG GAGTGAGCAA GATAGAAACT GCTCCTTATG TGCAGACAAC ATTGAAGGAA 420 

AGACATTTGT TTCAACAGTA AATTCTTTAG TTTTTCAACA AATAGATGCA AACTGGAACA 480 

40 TACAGTGCTG GCTAAAAGGA GACTTAAAAT TATTCATCTG TTATGTGGAG TCATTATTTA 540 

AGAATCTATT CAGGAATTAT AACTATAAGG TCCATCTTTT ATATGTTCTG CCTGAAGTGT 600 

TAGAAGATTC ACCTCTGGTT CCCCAAAAAG GCAGTTTTCA GATGGTTCAC TGCAATTGCA 660 

45 

GTGTTCACGA ATGTTGTGAA TGTCTTGTGC CTGTGCCAAC AGCCAAACTC AACGACACTC 720 

TCCTTATGTG TTTGAAAATC ACATCTGGTG GAGTAATTTT CCAGTCACCT CTAATGTCAG 780 

50 TTCAGCCCAT AAATATGGTG AAGCCTGATC CACCATTAGG TTTGCATATG GAAATCACAG 8 40 

ATGATGGTAA TTTAAAGATT TCTTGGTCCA GCCCACCATT GGTACCATTT CCACTTCAAT 900 
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ATCAAGTGAA ATATTCAGAG AATTCTACAA 
CAGCTACATC CCTGCTAGTA GACAGTATAC 
5 GGGGCAAGAG ACTGGATGGC CCAGGAATCT 
CCACACAAGA TGTCATATAC TTTCCACCTA 
CTTTTCACTG CATCTATAAG AAGGAAAACA 

10 

GGATGAATTT AGCTGAGAAA ATTCCTCAAA 
GCAAAGTTAC TTTTTTCAAT CTGAATGAAA 
15 CAGTGTACTG CTGCAATGAA CATGAATGCC 
ATGTCAATAT CAATATCTCA TGTGAAACTG 
GGTCAACCAG TACAATCCAG TCACTTGCGG 

20 

GCAGCCTTTA CTGTTCTGAT ATTCCATCTA 
ATT TGCAG AG TGATGGTTTT TATGAATGCA 
25 ACACAATGTG GATT AGGATC AATCACTCTC 
TCCTTCCTGA TTCTGTGGTG AAGCCACTGC 
TAAACATTGG ATTATTGAAA AT ATCTTGGG 

30 

AATTCCAGAT TCGCTATGGT TTAAGTGGAA 
ATGATGCAAA ATCAAAATCT GTCAGTCTCC 
35 TTCAGGTGCG CTGTAAGAGG CTAGATGGAC 
CCTACACAGT TGTCATGGAT ATAAAAGTTC 
TTAATGGAGA TACTATGAAA AAGGAGAAAA 

40 

AAAATGACTC ATTGTGCAGT GTTCAGAGAT 
GAACATGGTC AGAAGATGTG GGAAATCACA 
45 CACATACTGT TACGGTTCTG GCCATCAATT 
TAACCTTTTC ATGGCCTATG AGCAAAGTAA 
TAAACAGCAG TTGTGTGATT GTTTCCTGGA 

50 

ATTTTATTAT TGAGTGGAAA AATCTTAATG 
CTTCATCTGT TAAGAAGTAT TATATCCATG 



119 - 

CAGTTATCAG AGAAGCTGAC AAGATTGTCT 960 

TTCCTGGGTC TTCGTATGAG GTTCAGGTGA 1020 

GGAGTGACTG GAGTACTCCT CGTGTCTTTA 1080 

AAATTCTGAC AAGTGTTGGG TCTAATGTTT 1140 

AGATTGTTCC CTCAAAAGAG ATTGTTTGGT 1200 

GCCAGTATGA TGTTGTGAGT GATCATGTTA 1260 

CCAAACCTCG AGGAAAGTTT ACCTATGATG 1320 

ATCATCGCTA TGCTGAAT T A TATGTGATTG 138 0 

ATGGGTACTT AACTAAAATG ACTTGCAGAT 1440 

AAAGCACTTT GCAATTGAGG TATCATAGGA 1500 

TTCATCCCAT ATCTGAGCCC AAAGATTGCT 1560 

TTTTCCAGCC AATCTTCCTA TTATCTGGCT 1620 

TAGGTTCACT TGACTCTCCA CCAACATGTG 1680 

CTCCATCCAG TGTGAAAGCA GAAATTACTA 174 0 

AAAAGCCAGT CTTTCCAGAG AATAACCTTC 18 00 

AAGAAGTACA ATGGAAGATG TATGAGGTTT 1860 

CAGTTCCAGA CTTGTGTGCA GTCTATGCTG 1920 

TGGGATATTG GAGTAATTGG AGCAATCCAG 1980 

CTATGAGAGG ACCTGAATTT TGGAGAATAA 2040 

ATGTCACTTT ACTTTGGAAG CCCCTGATGA 2100 

ATGTGATAAA CCATCATACT TCCTGCAATG 2160 

CGAAATTCAC TTTCCTGTGG ACAGAGCAAG 2220 

CAATTGGTGC TTCTGTTGCA AATTTTAATT 2280 

ATATCGTGCA GTCACTCAGT GCTTATCCTT 2340 

TACTATCACC CAGTGATTAC AAGCTAATGT 2400 

AAGATGGTGA AATAAAATGG CTTAGAATCT 2460 

ATCATTTTAT CCCCATTGAG AAGTACCAGT 2520 
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TCAGTCTTTA CCCAATATTT ATGGAAGGAG TGGGAAAACC AAAGATAATT AATAGTTTCA 2580 

CTCAAGATGA TATTGAAAAA CACCAGAGTG ATGCAGGTTT ATATGTAATT GTGCCAGTAA 2640 

5 

TTATTTCCTC TTCCATCTTA TTGCTTGGAA CATTATTAAT ATCACACCAA AGAATGAAAA 2700 

AGCTATTTTG GG AAGATGTT CCGAACCCCA AGAATTGTTC CTGGGCACAA GGACTTAATT 27 60 

10 TTCAGAAGCC AGAAACGTTT GAGCATCTTT TTATCAAGCA TACAGCATCA GTGACATGTG 2820 

GTCCTCTTCT TTTGGAGCCT GAAACAATTT CAGAAGATAT CAGTGTTGAT ACATCATGGA 2880 

AAAATAAAGA TGAGATGATG CCAACAACTG TGGTCTCTCT ACTTTCAACA ACAGATCTTG 2940 

15 

AAAAGGGTTC TGTTTGTATT AGTGACCAGT TCAACAGTGT TAACTTCTCT GAGGCTGAGG 3000 

GTACTGAGGT AACCTATGAG GACGAAAGCC AGAGACAACC CTTTGTTAAA TACGCCACGC 30 60 

20 TGATCAGCAA CTCTAAACCA AGTGAAACTG GTGAAGAACA AGGGCTTATA AATAGTTCAG 3120 

TCACCAAGTG CTTCTCTAGC AAAAATTCTC CGTTGAAGGA TTCTTTCTCT AATAGCTCAT 3180 

GGGAGATAGA GGCCCAGGCA TTTTTTATAT TATCGGATCA GCATCCCAAC ATAATTTCAC 3240 

25 

CACACCTCAC ATTCTCAGAA GGATTGGATG AACTTTTGAA ATTGGAGGGA AATTTCCCTG 3300 

AAGAAAATAA TGATAAAAAG TCTATCTATT ATTTAGGGGT CACCTCAATC AAAAAGAGAG 33 60 

30 AGAGTGGTGT GCTTTTGACT GACAAGTCAA GGGTATCGTG CCCATTCCCA GCCCCCTGTT 3420 

TATTCACGGA CATCAGAGTT CTCCAGGACA GTTGCTCACA CTTTGTAGAA AATAATATCA 3480 

ACTTAGGAAC TTCTAGTAAG AAGACTTTTG CATCTTACAT GCCTCAATTC CAAACTTGTT 354 0 

35 

CTACTCAGAC TCATAAGATC ATGGAAAACA AGATGTGTGA CCTAACTGTG TAATCTAGA 3599 

(2) INFORMATION FOR SEQ ID NO : 9 : 

40 (i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 



45 



50 



<ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
NNNNNTACCT TTTCCAG 17 
5 (2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 839 amino acids 

(B) TYPE: amino acid 

10 (C) ST RAND ED NESS : single 

(D) TOPOLOGY: linear 



15 



25 



40 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10; 



20 Met He Cys Gin Lys Phe Cys Val Val Leu Leu His Trp Glu Phe lie 

1 5 10 15 



Tyr Val lie Thr Ala Phe Asn Leu Ser Tyr Pro He Thr Pro Trp Arg 
20 25 30 

Phe Lys Leu Ser Cys Met Pro Pro Asn Ser Thr Tyr Asp Tyr Phe Leu 
35 40 45 



Leu Pro Ala Gly Leu Ser Lys Asn Thr Ser Asn Ser Asn Gly His Tyr 
30 50 55 60 

Glu Thr Ala Val Glu Pro Lys Phe Asn Ser Ser Gly Thr His Phe Ser 
65 70 75 80 

35 Asn Leu Ser Lys Thr Thr Phe His Cys Cys Phe Arg Ser Glu Gin Asp 

85 90 95 



Arg Asn Cys Ser Leu Cys Ala Asp Asn He Glu Gly Lys Thr Phe Val 
100 105 110 

Ser Thr Val Asn Ser Leu Val Phe Gin Gin He Asp Ala Asn Trp Asn 
115 120 125 



He Gin Cys Trp Leu Lys Gly Asp Leu Lys Leu Phe He Cys Tyr Val 
45 130 135 140 

Glu Ser Leu Phe Lys Asn Leu Phe Arg Asn Tyr Asn Tyr Lys Val His 
145 150 155 160 



50 



Leu Leu Tyr Val Leu Pro Glu Val Leu Glu Asp Ser Pro Leu Val Pro 
165 170 175 
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Gin Lys Gly Ser Phe Gin Met Val His Cys Asn Cys Ser Val His Glu 
180 185 190 

Cys Cys Glu Cys Leu Val Pro Val Pro Thr Ala Lys Leu Asn Asp Thr 
5 195 200 205 

Leu Leu Met Cys Leu Lys lie Thr Ser Gly Gly Val lie Phe Gin Ser 
210 215 220 

!0 Pro Leu Met Ser Val Gin Pro lie Asn Met Val Lys Pro Asp Pro Pro 

225 230 235 240 



15 



30 



45 



Leu Gly Leu His Met Glu lie Thr Asp Asp Gly Asn Leu Lys He Ser 
245 250 255 

Trp Ser Ser Pro Pro Leu Val Pro Phe Pro Leu Gin Tyr Gin Val Lys 
260 265 270 



Tyr Ser Glu Asn Ser Thr Thr Val He Arg Glu Ala Asp Lys He Val 
20 275 280 285 

Ser Ala Thr Ser Leu Leu Val Asp Ser He Leu Pro Gly Ser Ser Tyr 
290 295 300 

25 Glu Val Gin Val Arg Gly Lys Arg Leu Asp Gly Pro Gly He Trp Ser 

305 310 315 320 



Asp Trp Ser Thr Pro Arg Val Phe Thr Thr Gin Asp Val He Tyr Phe 

325 330 335 

Pro Pro Lys He Leu Thr Ser Val Gly Ser Asn Val Ser Phe His Cys 

340 345 350 



He Tyr Lys Lys Glu Asn Lys He Val Pro Ser Lys Glu He Val Trp 

35 355 360 365 

Trp Met Asn Leu Ala Glu Lys He Pro Gin Ser Gin Tyr Asp Val Val 

370 375 380 

40 Ser Asp His Val Ser Lys Val Thr Phe Phe Asn Leu Asn Glu Thr Lys 

385 390 395 400 



Pro Arg Gly Lys Phe Thr Tyr Asp Ala Val Tyr Cys Cys Asn Glu His 

405 410 415 

Glu Cys His His Arg Tyr Ala Glu Leu Tyr Val He Asp Val Asn He 
420 425 430 



Asn He Ser Cys Glu Thr Asp Gly Tyr Leu Thr Lys Met Thr Cys Arg 
50 435 440 445 

Trp Ser Thr Ser Thr He Gin Ser Leu Ala Glu Ser Thr Leu Gin Leu 
450 455 460 
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10 



25 



40 



Arg Tyr His Arg Ser Ser Leu Tyr Cys Ser Asp He Pro Set He His 
465 470 475 480 

Pro He Ser Glu Pro Lys Asp Cys Tyr Leu Gin Ser Asp Gly Phe Tyr 
485 490 495 

Glu Cys He Phe Gin Pro He Phe Leu Leu Ser Gly Tyr Thr Met Trp 
500 505 510 

He Arg He Asn His Ser Leu Gly Ser Leu Asp Ser Pro Pro Thr Cys 
515 520 525 



Val Leu Pro Asp Ser Val Val Lys Pro Leu Pro Pro Ser Ser Val Lys 
15 530 535 540 

Ala Glu He Thr He Asn He Gly Leu Leu Lys He Ser Trp Glu Lys 
545 550 555 560 

20 Pro Val Phe Pro Glu Asn Asn Leu Gin Phe Gin He Arg Tyr Gly Leu 

565 570 575 

Ser Gly Lys Glu Val Gin Trp Lys Met Tyr Glu Val Tyr Asp Ala Lys 
580 585 590 



Ser Lys Ser Val Ser Leu Pro Val Pro Asp Leu Cys Ala Val Tyr Ala 
595 600 605 



Val Gin Val Arg Cys Lys Arg Leu Asp Gly Leu Gly Tyr Trp Ser Asn 
30 610 615 620 

Trp Ser Asn Pro Ala Tyr Thr Val Val Met Asp He Lys Val Pro Met 

625 630 635 640 

35 Arg Gly Pro Glu Phe Trp Arg He He Asn Gly Asp Thr Met Lys Lys 

645 650 655 



Glu Lys Asn Val Thr Leu Leu Trp Lys Pro Leu Met Lys Asn Asp Ser 
660 665 670 

Leu Cys Ser Val Gin Arg Tyr Val He Asn His His Thr Ser Cys Asn 
675 680 685 



Gly Thr Trp Ser Glu Asp Val Gly Asn His Thr Lys Phe Thr Phe Leu 
45 690 695 700 

Trp Thr Glu Gin Ala His Thr Val Thr Val Leu Ala He Asn Ser He 

705 710 715 720 

50 Gly Ala Ser Val Ala Asn Phe Asn Leu Thr Phe Ser Trp Pro Met Ser 

725 730 735 
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Lys Val Asn lie Val Gin Ser Leu Ser Ala Tyr Pro Leu Asn Ser Ser 
740 745 750 

Cys Val lie Val Ser Trp lie Leu Ser Pro Ser Asp Tyr Lys Leu Met 
755 ' 760 765 

Tyr Phe lie lie Glu Trp Lys Asn Leu Asn Glu Asp Gly Glu lie Lys 
770 775 780 

Trp Leu Arg lie Ser Ser Ser Val Lys Lys Tyr Tyr lie His Asp His 
785 790 795 800 

Phe lie Pro He Glu Lys Tyr Gin Phe Ser Leu Tyr Pro He Phe Met 
805 810 815 

Glu Gly Val Gly Lys Pro Lys He He Asn Ser Phe Thr Gin Asp Asp 
820 825 830 

He Glu Lys His Gin Ser Asp 
835 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 624 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GCGGCCGCCA GTGTGATGGA TATCTGCAGA ATTCGGCTTT CTCTGCCTTC GGTCGAGTTG 60 

GACCCCCGGA TCAAGGTGTA CTTCTCTGAA GTAAGATGAT TTGTCAAAAA TTCTGTGTGG 120 

TTTTGTTACA TTGGGAATTT ATTTATGTGA TAACTGCGTT TAACTTGTCA TATCCAATTA 180 

CTCCTTGGAG ATTTAAGTTG TCTTGCATGC CACCAAATTC AACCTATGAC TACTTCCTTT 240 

TGCCTGCTGG GCTCTCAAAG AATACTTCAA ATTCGAATGG ACATTATGAG ACAGCTGTTG 300 

AACCTAAGTT TAATTCAAGT GGTACTCACT TTTCTAACTT ATCCAAAACA ACTTTCCACT 360 

GTTGCTTTCG GAGTGAGCAA GATAGAAACT GCTCCTTATG TGCAGACAAC ATTGAAGGAA 420 

AGACATTTGT TTCAACAGTA AATTCTTTAG TTTTTCAACA AATAGATGCA AACTGGAACA 480 

TACAGTGCTG GCTAAAAGGA GACTTAAAAT TATTCATCTG TTATGTGGAG TCATTATTTA 540 
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AGAATCTATT CAGGAATTAT AACTATAAGG TCCATCT TTT ATATGTTCTG CCTGAAGTGT 600 

TAGAAGATTC ACCTCTGGTT CCCCAAAAAG GCAGTTTTCA GATGGTTCAC TGCAATTGCA 660 

GTGTTCACGA ATGTTGTGAA TGTCTTGTGC CTGTGCCAAC AGCCAAACTC AACGACACTC 720 

TCCTTATGTG TTTGAAAATC ACATCTGGTG GAGTAATTTT CCAGTCACCT CTAATGTCAG 780 

TTCAGCCCAT AAATATGGTG AAGCCTGATC CACCATTAGG TTTGCATATG GAAATCACAG 840 

ATGATGGTAA TTTAAAGATT TCTTGGTCCA GCCCACCATT GGTACCATTT CCACTTCAAT 900 

ATCAAGTGAA ATATTCAGAG AATTCTACAA CAGTTATCAG AGAAGCTGAC AAGATTGTCT 960 

CAGCTACATC CCTGCTAGTA GACAGTATAC TTCCTGGGTC TTCGTATGAG GTTCAGGTGA 1020 

GGGGCAAGAG ACTGGATGGC CCAGGAATCT GGAGTGACTG GAGTACTCCT CGTGTCTTTA 1080 

CCACACAAGA TGTCATATAC TTTCCACCTA AAATTCTGAC AAGTGTTGGG TCTAATGTTT 1140 

CTTTTCACTG CATCTATAAG AAGGAAAACA AGATTGTTCC CTCAAAAGAG ATTGTTTGGT 1200 

GGATGAATTT AGCTGAGAAA ATTCCTCAAA GCCAGTATGA TGTTGTGAGT GATCATGTTA 1260 

GCAAAGTTAC TTTTTTCAAT CTGAATGAAA CCAAACCTCG AGGAAAGTTT ACCTATGATG 1320 

CAGTGTACTG CTGCAATGAA CATGAATGCC ATCATCGCTA TGCTGAATTA TATGTGATTG 1380 

ATGTCAATAT CAATATCTCA TGTGAAACTG ATGGGTACTT AACTAAAATG ACTTGCAGAT 1440 

GGTCAACCAG TACAATCCAG TCACTTGCGG AAAGCACTTT GCAATTGAGG TATCATAGGA 1500 

GCAGCCTTTA CTGTTCTGAT ATTCCATCTA TTCATCCCAT ATCTGAGCCC AAAGATTGCT 1560 

ATTTGCAGAG TGATGGTTTT TATGAATGCA TTTTCCAGCC AATCTTCCTA TTATCTGGCT 1620 

ACACAATGTG GATTAGGATC AATCACTCTC TAGGTTCACT TGACTCTCCA CCAACATGTG 1680 

TCCTTCCTGA TTCTGTGGTG AAGCCACTGC CTCCATCCAG TGTGAAAGCA GAAATTACTA 1740 

TAAACATTGG ATTATTGAAA ATATCTTGGG AAAAGCCAGT CTTTCCAGAG AATAACCTTC 1800 

AATTCCAGAT TCGCTATGGT TTAAGTGGAA AAGAAGTACA ATGGAAGATG TATGAGGTTT I860 

ATGATGCAAA ATCAAAATCT GTCAGTCTCC CAGTTCCAGA CTTGTGTGCA GTCTATGCTG 1920 

TTCAGGTGCG CTGTAAGAGG CTAGATGGAC TGGGATATTG GAGTAATTGG AGCAATCCAG 1980 

CCTACACAGT TGTCATGGAT ATAAAAGTTC CTATGAGAGG ACCTGAATTT TGGAGAATAA. 2040 

TTAATGGAGA TACTATGAAA AAGGAGAAAA ATGTCACTTT ACTTTGGAAG CCCCTGATGA 2100 
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AAAATGACTC ATTGTGCAGT GTTCAGAGAT ATGTGATAAA CCATCATACT TCCTGCAATG 2X60 

GAACATGGTC AGAAGATGTG GGAAATCACA CGAAATTCAC TTTCCTGTGG ACAGAGCAAG 2220 

CACATACTGT TACGGTTCTG GCCATCAATT CAATTGGTGC TTCTGTTGCA AATTTTAATT 2280 

TAACCTTTTC ATGGCCTATG AGCAAAGTAA ATATCGTGCA GTC AC TCAGT GCTTATCCTT 2340 

TAAACAGCAG TTGTGTGATT GTTTCCTGGA TACTATCACC CAGTGATTAC AAGCTAATGT 2400 

ATTTTATTAT TGAGTGGAAA AATCTTAATG AAGATGGTGA AATAAAATGG CTTAGAATCT 2460 

CTTCATCTGT TAAGAAGTAT TATATCCATG ATCATTTTAT CCCCATTGAG AAGTACCAGT 2520 

TCAGTCTTTA CCCAATATTT ATGGAAGGAG TGGGAAAACC AAAGATAATT AATAGTTTCA 2580 

CTCAAGATGA TATTGAAAAA CACCAGAGTG ATTGATAAGG ATCC 2624 
(2) INFORMATION FOR SEQ ID NO : 12 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2948 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



CCATTGAAGT 


CAATGGGAGT 


TTGTTTTGGC 


ACCAAAATCA 


ACGGGGATTT 


CCAAAATGTC 


60 


GTAATAACCC 


CGCCCCGTTG 


ACGCAAATGG 


GCGGTAGGCG 


TGTACGGTGG 


GAGGTCTATA 


120 


TAAGCAGAGC 


TCGTTTAGTG 


AACCGTCAGA 


TCTCTAGAAG 


CTGGGTACCA 


GCTGCTAGCA 


180 


AGCT TGCT AG 


CGGCCGCCAG 


TGTGATGGAT 


ATCTGCAGAA 


TTCGGCTTTC 


TCTGCCT TCG 


240 


GTCGAGTTGG 


ACCCCCGGAT 


CAAGGTGTAC 


TTCTCTGAAG 


TAAGATGATT 


TGTCAAAAAT 


300 


TCTGTGTGGT 


TTTGTTACAT 


TGGGAATTTA 


TTTATGTGAT 


AACTGCGTTT 


AACTTGTCAT 


360 


ATCCAATTAC 


TCCTTGGAGA 


TTTAAGTTGT 


CTTGCATGCC 


ACCAAATTCA 


ACCTATGACT 


420 


ACTTCCTTTT 


GCCTGCTGGG 


CTCTCAAAGA 


ATACTTCAAA 


TTCGAATGGA 


CATTATGAGA 


480 


CAGCTGTTGA 


ACCTAAGTTT 


AATTCAAGTG 


GTACTCACTT 


TTCTAACTTA 


TCCAAAACAA 


540 


CTTTCCACTG 


TTGCTTTCGG 


AGTGAGCAAG 


ATAGAAACTG 


CTCCTTATGT 


GCAGACAACA 


600 
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TTGAAGGAAA GACATTTGTT TCAACAGTAA 
ACTGGAACAT ACAGTGCTGG CTAAAAGGAG 
5 CATTATTTAA GAATCTATTC AGGAATTATA 
CTGAAGTGTT AGAAGATTCA CCTCTGGTTC 
GCAATTGCAG TGTTCACGAA TGTTGTGAAT 

10 

ACGACACTCT CCTTATGTGT TTGAAAATCA 
TAATGTCAGT TCAGCCCATA AATATGGTGA 
15 AAATCACAGA TGATGGTAAT TTAAAGATTT 
CACTTCAATA TCAAGTGAAA TATTCAGAGA 
AGATTGTCTC AGCTACATCC CTGCTAGTAG 

20 

TTCAGGTGAG GGGCAAGAGA CTGGATGGCC 
GTGTCTTTAC CACACAAGAT GTCATATACT 
25 CTAATGTTTC TTTTCACTGC ATCTATAAGA 
TTGTTTGGTG GATGAATTTA GCTGAGAAAA 
ATCATGTTAG CAAAGTTACT TTTTTCAATC 

30 

CCTATGATGC AGTGTACTGC TGCAATGAAC 
ATGTGATTGA TGTCAATATC AATATCTCAT 
35 CTTGCAGATG GTCAACCAGT ACAATCCAGT 
ATCATAGGAG CAGCCTTTAC TGTTCTGATA 
AAGATTGCTA TTTGCAGAGT GATGGTTTTT 

40 

TATCTGGCTA CACAATGTGG ATTAGGATCA 
CAACATGTGT CCTTCCTGAT TCTGTGGTGA 
45 AAATTACTAT AAACAT TGG A TTATTGAAAA 
ATAACCTTCA ATTCCAGATT CGCTATGGTT 
ATGAGGTTTA TGATGCAAAA TCAAAATCTG 

50 

TCTATGCTGT TCAGGTGCGC TGTAAGAGGC 
GCAATCCAGC CTACACAGTT GTCATGGATA 
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ATTCTTTAGT TTTTCAACAA ATAGATGCAA 660 

ACTTAAAATT ATTCATCTGT TATGTGGAGT 720 

ACTATAAGGT CCATCTTTTA TATGTTCTGC 780 

CCCAAAAAGG CAGTTTTCAG ATGGTTCACT 840 

GTCTTGTGCC TGTGCCAACA GCCAAACTCA 900 

CATCTGGTGG AGTAATTTTC CAGTCACCTC 960 

AGCCTGATCC ACCATTAGGT TTGCATATGG 1020 

CTTGGTCCAG CCCACCATTG GTACCATTTC 1080 

ATTCTACAAC AGTTATCAGA GAAGCTGACA 1140 

ACAGTATACT TCCTGGGTCT TCGTATGAGG 1200 

CAGGAATCTG GAGTGACTGG AGTACTCCTC 12 60 

TTCCACCTAA AATTCTGACA AGTGTTGGGT 1320 

AGGAAAACAA GATTGTTCCC TCAAAAGAGA 1380 
TTCCTCAAAG CCAGTATGAT GTTGTGAGTG - 1440 

TGAATGAAAC CAAACCTCGA GGAAAGTTTA 1500 

ATGAATGCCA TCATCGCTAT GCTGAATTAT 1560 

GTGAAACTGA TGGGTACTTA ACTAAAATGA 1620 

CACTTGCGGA AAGCACTTTG CAATTGAGGT 1680 

TTCCATCTAT TCATCCCATA TCTGAGCCCA 1740 

ATGAATGCAT TTTCCAGCCA ATCTTCCTAT 1800 

ATCACTCTCT AGGTTCACTT GACTCTCCAC 18 60 

AGCCACTGCC TCCATCCAGT GTGAAAGCAG 1920 

TATCTTGGGA AAAGCCAGTC TTTCCAGAGA 1980 
TAAGTGGAAA AGAAGTACAA TGGAAGATGT . 2040 

TCAGTCTCCC AGTTCCAGAC TTGTGTGCAG 2100 

TAGATGGACT GGGATATTGG AGTAATTGGA 2160 

TAAAAGTTCC TATGAGAGGA CCTGAATTTT 2220 
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35 



40 



45 



GGAGAATAAT 


TAATGGAGAT 


ACTATGAAAA 


AGGAGAAAAA 


TGTCACTTTA 


CTTTGGAAGC 


2280 


CCCTGATGAA 


AAATGACTCA 


TTGTGCAGTG 


TTCAGAGATA 


TGTGATAAAC 


CATCATACTT 


2340 


CCTGCAATGG 


AACATGGTCA 


GAAGATGTGG 


GAAATCACAC 


GAAATTCACT 


TTCCTGTGGA 


2400 


CAGAGCAAGC 


ACATACTGTT 


ACGGTTCTGG 


CCATCAATTC 


AATTGGTGCT 


TCTGTTGCAA 


2460 


ATTTTAATTT 


AACCTTTTCA 


TGGCCTATGA 


GCAAAGTAAA 


TATCGTGCAG 


TCACTCAGTG 


2520 


CTTATCCTTT 


AAACAGCAGT 


TGTGTGATTG 


TTTCCTGGAT 


ACTATCACCC 


AGTGATTACA 


2580 


AGCTAATGTA 


TTTTATTATT 


GAGTGGAAAA 


ATCTTAATGA 


AGATGGTGAA 


ATAAAATGGC 


2640 


TTAGAATCTC 


TTCATCTGTT 


AAGAAGTATT 


ATATCCATGA 


TCATTTTATC 


CCCATTGAGA 


2700 


AGTACCAGTT 


CAGTCTTTAC 


CCAATATTTA 


TGGAAGGAGT 


GGGAAAACCA 


AAGATAATTA 


2760 


ATAGT TTCAC 


TCAAGATGAT 


ATTGAAAAAC 


ACCAGAGTGA 


TGCAGGTGAC 


TACAAGGACG 


2820 


ACGATGACAA 


GTAGGGATCC 


AGACATGATA 


AGATACATTG 


ATGAGTTTGG 


ACAACCCACA 


2880 


ACTAGAATGC 


AGTGAAAAAA 


ATGCTTTATT 


TGTGAAATTT 


GTGATGCTAT 


TGCTTTATTT 


2940 


GTAACCAT 












2948 


(2) INFORMATION FOR SEQ ID NO: 13 











10 



15 



20 



25 



30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



Met lie Cys Gin Lys Phe Cys Val Val Leu Leu His Trp Glu Phe He 
15 10 15 

Tyr Val He Thr Ala Phe Asn Leu Ser Tyr Pro He Thr Pro Trp Arg 
20 25 30 

Phe Lys Leu Ser Cys Met Pro Pro Asn Ser Thr Tyr Asp Tyr Phe Leu 
50 35 40 45 



Leu Pro Ala Gly Leu Ser Lys Asn Thr Ser Asn Ser Asn Gly His Tyr 
50 55 60 
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10 



25 



40 



Glu Thr Ala Val Glu Pro Lys Phe Asn Ser Ser Gly Thr His Phe Ser 
65 70 75 80 

Asn Leu Ser Lys Thr Thr Phe His Cys Cys Phe Arg Ser Glu Gin Asp 
85 90 95 

Arg Asn Cys Ser Leu Cys Ala Asp Asn lie Glu Gly Lys Thr Phe Val 
100 105 110 

Ser Thr Val Asn Ser Leu Val Phe Gin Gin lie Asp Ala Asn Trp Asn 
115 120 125 



lie Gin Cys Trp Leu Lys Gly Asp Leu Lys Leu Phe lie Cys Tyr Val 
15 130 135 140 

Glu Ser Leu Phe Lys Asn Leu Phe Arg Asn Tyr Asn Tyr Lys Val His 
145 150 155 160 

20 Leu Leu Tyr Val Leu Pro Glu Val Leu Glu Asp Ser Pro Leu Val Pro 

165 170 175 



Gin Lys Gly Ser Phe Gin Met Val His Cys Asn Cys Ser Val His Glu 

180 185 190 

Cys Cys Glu Cys Leu Val Pro Val Pro Thr Ala Lys Leu Asn Asp Thr 

195 200 205 



Leu Leu Met Cys Leu Lys lie Thr Ser Gly Gly Val lie Phe Gin Ser 
30. 210 215 220 

Pro Leu Met Ser Val Gin Pro lie Asn Met Val Lys Pro Asp Pro Pro 
225 230 235 240 

35 Leu Gly Leu His Met Glu lie Thr Asp Asp Gly Asn Leu Lys lie Ser 

245 250 255 

Trp Ser Ser Pro Pro Leu Val Pro Phe Pro Leu Gin Tyr Gin Val Lys 
260 265 270 



Tyr Ser Glu Asn Ser Thr Thr Val lie Arg Glu Ala Asp Lys lie Val 
275 280 285 



Ser Ala Thr Ser Leu Leu Val Asp Ser lie Leu Pro Gly Ser Ser Tyr 
45 290 295 300 

Glu Val Gin Val Arg Gly Lys Arg Leu Asp Gly Pro Gly lie Trp Ser 
305 310 315 320 

50 Asp Trp Ser Thr Pro Arg Val Phe Thr Thr Gin Asp Val lie Tyr Phe 

325 330 335 
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Pro Pro Lys lie Leu Thr Ser Val Gly Ser Asn Val Ser Phe His Cys 
340 345 350 

lie Tyr Lys Lys Glu Asn Lys lie Val Pro Ser Lys Glu lie Val Trp 
355 360 365 

Trp Met Asn Leu Ala Glu Lys lie Pro Gin Ser Gin Tyr Asp Val Val 
370 375 380 

Ser Asp His Val Ser Lys Val Thr Phe Phe Asn Leu Asn Glu Thr Lys 
385 390 395 400 

Pro Arg Gly Lys Phe Thr Tyr Asp Ala Val Tyr Cys Cys Asn Glu His 
405 410 415 

Glu Cys His His Arg Tyr Ala Glu Leu Tyr Val lie Asp Val Asn lie 
420 425 430 

Asn lie Ser Cys Glu Thr Asp Gly Tyr Leu Thr Lys Met Thr Cys Arg 
435 440 445 

Trp Ser Thr Ser Thr lie Gin Ser Leu Ala Glu Ser Thr Leu Gin Leu 
450 455 460 

Arg Tyr His Arg Ser Ser Leu Tyr Cys Ser Asp lie Pro Ser lie His 
465 470 475 480 

Pro He Ser Glu Pro Lys Asp Cys Tyr Leu Gin Ser Asp Gly Phe Tyr 
485 490 . 495 

Glu Cys He Phe Gin Pro He Phe Leu Leu Ser Gly Tyr Thr Met Trp 
500 505 510 

He Arg He Asn His Ser Leu Gly Ser Leu Asp Ser Pro Pro Thr Cys 
515 520 , 525 

Val Leu Pro Asp Ser Val Val Lys Pro Leu Pro Pro Ser Ser Val Lys 
530 535 540 

Ala Glu He Thr He Asn He Gly Leu Leu Lys He Ser Trp Glu Lys 
545 550 555 560 

Pro Val Phe Pro Glu Asn Asn Leu Gin Phe Gin He Arg Tyr Gly Leu 
565 570 575 

Ser Gly Lys Glu Val Gin Trp Lys Met Tyr Glu Val Tyr Asp Ala Lys 
580 585 590 

Ser Lys Ser Val Ser Leu Pro Val Pro Asp Leu Cys Ala Val Tyr Ala 
595 600 €05 



Val Gin Val Arg Cys Lys Arg Leu Asp Gly Leu Gly Tyr Trp Ser Asn 
610 615 620 
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10 



25 



45 



Trp Ser Asn Pro Ala Tyr Thr Val Val Met Asp lie Lys Val Pro Met 
625 630 635 640 

Arg Gly Pro Glu Pfie Trp Arg lie lie Asn Gly Asp Thr Met Lys Lys 
645 650 655 

Glu Lys Asn Val Thr Leu Leu Trp Lys Pro Leu Met Lys Asn Asp Ser 
660 665 670 

Leu Cys Ser Val Gin Arg Tyr Val lie Asn His His Thr Ser Cys Asn 
675 680 685 



Gly Thr Trp Ser Glu Asp Val Gly Asn His Thr Lys Phe Thr Phe Leu 
15 690 695 700 

Trp Thr Glu Gin Ala His Thr Val Thr Val Leu Ala lie Asn Ser lie 
705 710 715 720 

20 Gly Ala Ser Val Ala Asn Phe Asn Leu Thr Phe Ser Trp Pro Met Ser 

725 730 735 

Lys Val Asn He Val Gin Ser Leu Ser Ala Tyr Pro Leu Asn Ser Ser 
740 745 750 



Cys Val He Val Ser Trp lie Leu Ser Pro Ser Asp Tyr Lys Leu Met 
755 760 765 



Tyr Phe He He Glu Trp Lys Asn Leu Asn Glu Asp Gly Glu He Lys 
30 770 775 780 

Trp Leu Arg He Ser Ser Ser Val Lys Lys Tyr Tyr He His Gly Lys 
785 790 795 800 

35 Phe Thr He Leu 



(2) INFORMATION FOR SEQ ID NO: 14: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2507 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



50 



10 



20 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

GCGGCCGCCA GTGTGATGGA TATCTGCAGA ATTCGGCTTT CTCTGCCTTC GGTCGAGTTG 60 

GACCCCCGGA TCAAGGTGTa' CTTCTCTGAA GTAAGATGAT TTGTCAAAAA TTCTGTGTGG 120 

TTTTGTTACA TTGGGAATTT ATTTATGTGA TAACTGCGTT TAACTTGTCA TATCCAATTA 180 

CTCCTTGGAG ATTTAAGTTG TCTTGCATGC CACCAAATTC AACCTATGAC TACTTCCTTT 240 

TGCCTGCTGG GCTCTCAAAG AATACTTCAA ATTCGAATGG ACATTATGAG AC AGCTGT TG 300 

AACCTAAGTT TAATTCAAGT GGTACTCACT TTTCTAACTT ATCCAAAACA ACTTTCCACT 360 

15 GTTGCTTTCG GAGTGAGCAA GATAGAAACT GCTCCTTATG TGCAGACAAC ATTGAAGGAA 420 

AGACATTTGT TTCAACAGTA AATTCTTTAG TTTTTCAACA AATAGATGCA AACTGGAACA 4 80 

TACAGTGCTG GCTAAAAGGA GACTTAAAAT TATTCATCTG TTATGTGGAG TCATTATTTA 540 

AGAATCTATT CAGGAATTAT AACTATAAGG TCCATCTTTT ATATGTTCTG CCTGAAGTGT 600 

TAGAAGATTC ACCTCTGGTT CCCCAAAAAG GCAGTTTTCA GATGGTTCAC TGCAATTGCA 6 60 

25 GTGTTCACGA ATGTTGTGAA TGTCTTGTGC CTGTGCCAAC AGCCAAACTC AACGACACTC 720 

TCCTTATGTG TTTGAAAATC ACATCTGGTG GAGTAATTTT CCAGTCACCT CTAATGTCAG 780 

TTCAGCCCAT AAATATGGTG AAGCCTGATC CACCATTAGG TTTGCATATG GAAATCACAG 84 0 

ATGATGGTAA TTTAAAGATT TCTTGGTCCA GCCCACCATT GGTACCATTT CCACTTCAAT 900 

ATCAAGTGAA ATATTCAGAG AATTCTACAA CAGTTATCAG AGAAGCTGAC AAGATTGTCT 960 

35 CAGCTACATC CCTGCTAGTA GACAGTATAC TTCCTGGGTC TTCGTATGAG GTTCAGGTGA 1020 

GGGGCAAGAG ACTGGATGGC CCAGGAATCT GGAGTGACTG GAGTACTCCT CGTGTCTTTA 1080 

CCACACAAGA TGTCATATAC TTTCCACCTA AAATTCTGAC AAGTGTTGGG TCTAATGTTT 1140 

CTTTTCACTG CATCTATAAG AAGGAAAACA AGATTGTTCC CTCAAAAGAG ATTGTTTGGT 1200 

GGATGAATTT AGCTGAGAAA ATTCCTCAAA GCCAGTATGA TGTTGTGAGT GATCATGTTA 1260 

45 GCAAAGTTAC TTTTTTCAAT CTGAATGAAA CCAAACCTCG AGGAAAGTTT ACCTATGATG 1320 

CAGTGTACTG CTGCAATGAA CATGAATGCC ATCATCGCTA TGCTGAATTA TATGTGATTG 138 0 

ATGTCAATAT CAATATCTCA TGTGAAACTG ATGGGTACTT AACTAAAATG ACTTGCAGAT 1440 

GGTCAACCAG TACAATCCAG TCACTTGCGG AAAGCACTTT GCAATTGAGG TATCATAGGA 1500 

GCAGCCTTTA CTGTTCTGAT ATTCCATCTA TTCATCCCAT ATCTGAGCCC AAAGATTGCT 1560 



30 



40 



50 
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AT TTGCAGAG TGATGGTTTT TATGAATGCA TTTTCCAGCC AATCTTCCTA TTATCTGGCT 1620 
ACACAATGTG GATTAGGATC AATCACTCTC TAGGTTCACT TGACTCTCCA CCAACATGTG 1680 

5 

TCCTTCCTGA TTCTGTGGTG XAGCCACTGC CTCCATCCAG TGTGAAAGCA GAAATTACTA 1740 
TAAACATTGG ATTATTGAAA ATATCTTGGG AAAAGCCAGT CTTTCCAGAG AATAACCTTC 1800 
10 AATTCCAGAT TCGCTATGGT TTAAGTGGAA AAGAAGTACA ATGGAAGATG TATGAGGTTT 1860 
ATGATGCAAA ATCAAAATCT GTCAGTCTCC CAGTTCCAGA CTTGTGTGCA GTCTATGCTG 1920 
TTCAGGTGCG CTGTAAGAGG CTAGATGGAC TGGGATATTG GAGTAATTGG AGCAATCCAG 1980 

15 

CCTACACAGT TGTCATGGAT ATAAAAGTTC CTATGAGAGG ACCTGAATTT TGGAGAATAA 2040 
TTAATGGAGA TACTATGAAA AAGGAGAAAA ATGTCACTTT ACTTTGGAAG CCCCTGATGA 2100 
20 AAAAT GACTC ATTGTGCAGT GTTCAGAGAT ATGTGATAAA CCATCATACT TCCTGCAATG 2160 
GAACATGGTC AGAAGATGTG GGAAATCACA CGAAATTCAC TTTCCTGTGG AC AG AG C AAG 2220 
CACATACTGT TACGGTTCTG GCCATCAATT CAATTGGTGC TTCTGTTGCA AATTTTAATT 2280 

25 

TAACCTTTTC ATGGCCTATG AGCAAAGTAA ATATCGTGCA GTCACTCAGT GCTTATCCTT 2340 
TAAACAGCAG TTGTGTGATT GTTTCCTGGA TACTATCACC CAGTGATTAC AAGCTAATGT 2400 
30 ATTTTATTAT TGAGTGGAAA AATCTTAATG AAGATGGTGA AATAAAATGG CTTAGAATCT 24 60 

CTTCATCTGT TAAGAAGTAT TATATCCATG GTAAGTTTAC TATACTT 2507 
(2) INFORMATION FOR SEQ ID NO: 15: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE : nucleic acid 

(C) STRAND EDNES S : single 
40 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GTAAGTTATT TGNNNNNATA TCCTAACAG 

50 
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(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 
GTAAGCATTA GCNNNNNTTT TAAATTCAG 
(2) INFORMATION FOR SEQ ID NO: 17: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GTAAGTACCA AANNNNNTTT TCAATATAG 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GTAAGTTATG CANNNNNTTT TTCCTTAAG 
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(2) INFORMATION FOR SEQ ID NO: 19: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GTAAGTATAT TTNNNNAATA TTTAACAG 28 
(2) INFORMATION FOR SEQ ID NO: 20: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 Base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
GTAGGTTATG TANNNNNCCC TCATTACAG 29 
35 (2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
50 GTAAGAAAAC AGNNNNNTGT TTCAAATAG 



29 
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(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

GTACGTATTA TTNNNNNTAT CTTTTAAAG 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
GTATGTCAAG CTNNNNNAAA AATTTCTAG 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
GTACCTTTTA CTNNNNNCTT ATTTTACAG 
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(2) INFORMATION FOR SEQ ID NO:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 
GTCTGCAGAG ATNNNNNGTC ATTTTGCAG 29 
(2) INFORMATION FOR SEQ ID NO:26: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: CDNA 



30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
GTATTCCCAA TTNNNNNTAT TT ACT AC AG 29 
35 (2) INFORMATION FOR SEQ ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 
40 (C> STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 
50 GTATTCCCAA TTNNNNNTAT TTACTACAG 
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(2) INFORMATION FOR SEQ ID NO:28: 

<i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 29 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: CDNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
GTAAGTTTAC TANNNNNTTT TCTCCTCAG 29 
(2) INFORMATION FOR SEQ ID NO: 29: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



30 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 9 : 

GTAAAAATTA TANNNNNTTT CTTTTTCAG 29 

35 (2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 29 base pairs 
<B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
50 GTATTGTACT TGNNNNNTAT CCTTTGTAG 



29 
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(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
GTTGCTTTTT CANNNNNTTA TCTAAACAG 29 
(2) INFORMATION FOR SEQ ID NO: 32: 

20 <i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



30 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
GTACATTTGT CTNNNNNCTT TTCTTTTAG 29 
35 (2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 
40 <C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
50 GTATCCAGTG TTNNNNNCTT TTTAAACAG 



29 
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CLAIMS 



10 



1. An OB receptor protein preparation 
containing an OB receptor protein, optionally in a 
pharmaceutical^ acceptable formulation, said OB 
rece ptor protein having part or all of the amino acid 
sequence according to Seq. ID No. 1 and one or more of 
the biological properties of naturally occurring OB 
receptor protein. 



2. An OB receptor protein preparation 
containing an OB receptor protein, optionally in a 
pharmaceutically acceptable formulation, wherein said OB 
15 receptor protein amino acid sequence is selected from 

among amino acid sequences (according to Seq. ID No. 1) : 

(a) 1-896; 

(b) 22-896 optionally with an N-terminal 
methionyl residue; 

20 (c) 23-896 optionally with an N-terminal 

methionyl residue; 

(d) 29-896 optionally with an N-terminal 

methionyl residue; 

(e) 1-839; 

25 (f) 22-839 optionally with an N-terminal 
methionyl residue; 

(g) 29-839 optionally with an N-terminal 

methionyl residue; 

(h) 1-841; 

30 (i) 22-841 optionally with an N-terminal 
methionyl residue; 

(j) 23-841 optionally with an N-terminal 

methionyl residue; 

(k) 29-841 optionally with an N-terminal 
35 methionyl residue; 

(1) 1-891; 
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(m) 22-891 optionally with an N-terminal 
methionyl residue; 

(n) 23-891 optionally with an N-terminal 
methionyl residue; 
5 (o) 29-891 optionally with an N-terminal 

methionyl residue; 

(p) of subparts (1) through (o) further 
having the C-terminal amino acids, beginning at position 
892, of OB receptor B (Seq. ID No. 3) or C (Seq. ID. No. 
10 5) ; and, 

(q) a chemically modified derivative of 
any of subparts (a) through (p) . 

3. An OB receptor protein preparation of 
15 claim 2 wherein said OB receptor protein is further 

selected from among the OB receptor proteins of subparts 
(1) through (o) further having the C-terminal amino 
acids, beginning at position 8 92, of OB receptor protein 
D (Seq. ID No. 7) . 

20 

4. An OB receptor protein preparation of 
claim 2 wherein said OB receptor protein is further 
selected from among the OB receptor proteins of subparts 
(1) through (o) further having substituted the C- 

25 terminal amino acids, beginning at position 799, G K F T 
I L (Seq. ID No. 13) . 
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5. An OB receptor protein preparation 
according to any of claims 1 through 4, wherein the 
extracellular domain of said OB receptor protein is 
5 modified, said modification selected from among: 

(a) deletion of all or part of the 
random coil domain; 

(b) modification of one or both ^WSXWS" 
boxes by substition of the first serine with another 

10 amino acid; 

(c) modification of one or both ^WSXWS" 
boxes by substitution of the last serine with another 
amino acid; and 

(d) modification of one or both "WSXWS" 
15 boxes by substitution of the first tryptophan with 

another amino acid. 

6. A DNA molecule encoding an OB receptor 
protein according to any of claims 1-5 selected from the 

20 group consisting of: 

(a) the DNA sequences set forth in Seq. 
ID nos. 2, 4, 6, 8, 11, 12, and 14; 

(b) a DNA which selectively hybridizes 
to a DNA of subpart (a) ; and 

25 (c) a DNA which, but for the degeneracy 

of the genetic code would hybridize to a DNA of subpart 
(a) or (b) . 

7. A biologically functional viral or 
30 plasmid vector containing a DNA of claim 6. 

8. A procaryotic or eucaryotic host cell 
containing the vector of claim 7 . 

35 9. A host cell modified so that expression 

of endogenous OB receptor protein is enhanced. 
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10. A host cell of claim 9 which is an 
isolated human host cell. 

5 11. A process for producing an OB receptor 

protein comprised of culturing, under suitable 
conditions, a host cell according to any of claims 8, 9 
or 10, obtaining the OB receptor produced, and 
optionally preparing a pharmaceutical composition 
10 containing said OB receptor. 

12. A method of treating an individual for a 
therapeutic disorder selected from among obesity, 
diabetes, high blood lipid levels, and high cholesterol 
15 levels comprised of administering a therapeutic amount 
of an OB receptor protein preparation containing an OB 
receptor protein according to any of claims 1-5, or 
produced by the process according to claim 11. 

20 13. A method of treating an individual for 

weight loss or weight maintenance for solely cosmetic 
purposes comprised of administering an effective amount 
of an OB receptor preparation containing an OB receptor 
protein according to any of claims 1-5, or produced by 

25 the process according to claim 11. 

14. Use of an OB receptor protein according 
to claims 1-5, or produced by the process of claim 11, 
for manufacturing a medicament for the treatment of 
30 obesity, diabetes, high blood lipid levels, or high 
cholesterol levels - 
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15. An OB protein/OB receptor protein complex 
preparation, .containing an OB protein moiety and an OB 
receptor protein moiety, optionally in a pharmaceuti- 

5 cally acceptable formulation, wherein: 

(a) said OB receptor protein is 
selected from among those set forth in any of claims 1 
and 2; 

(b) said OB protein moiety is selected 

10 from among: 

(i) a naturally ocurring OB 

protein; and, 

(ii) a non-naturally ocurring OB 
protein, analog or derivative thereof. 

15 

16. An OB protein/OB receptor protein complex 
preparation of claim 15 wherein said OB receptor protein 
is selected from among those set forth in any of claims 

3 , 4 , and 5 . 

20 

17. A method of treating an individual for a. 
therapeutic disorder selected from among obesity, 
diabetes, high blood lipid levels, and high cholesterol 
levels comprised of administering a therapeutic amount 

25 of an OB protein/OB receptor protein complex preparation 
of claims 15 or 16. 

18. A method of claim 17 wherein said OB 
protein/OB receptor protein complex preparation is 

30 formed in vivo by administering, into a patient, a first 
population of cells expressing an OB protein, and a 
second population of cells expressing an OB receptor 
protein- 
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19. A method of treating an individual for 
weight loss or weight maintenance for solely cosmetic 
purposes comprised of administering a therapeutic amount 
5 of an OB protein/OB receptor protein complex preparation 
containing an OB receptor protein moiety according to 
any of claims 1-5, or produced by the process according 
to claim 11. 

10 20. Use of an OB protein/OB receptor protein 

complex preparation, according to claims 15 or 16, for 
manufacturing a medicament for the treatment of obesity, 
diabetes, high blood lipid levels, or high cholesterol 
levels . 
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